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Abstract 

We consider a system of particles which perform branching Brownian motion with negative 
drift and are killed upon reaching zero, in the near-critical regime where the total population 
stays roughly constant with approximately N particles. We show that the characteristic 
time scale for the evolution of this population is of order (log TV) 3 , in the sense that when 
time is measured in these units, the scaled number of particles converges to a variant of 
Neveu's continuous-state branching process. Furthermore, the genealogy of the particles is 
then governed by a coalescent process known as the Bolthausen-Sznitman coalescent. This 
validates the non-rigorous predictions by Brunet, Derrida, Muller, and Munier for a closely 
related model. 
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1 Introduction 



Branching Brownian motion is a stochastic process in which, at time zero, there is a single 
particle at the origin. Each particle moves according to a standard Brownian motion for an 
exponentially distributed time with mean one, at which point it splits into two particles. Early 
work on branching Brownian motion, going back to McKean [S3], focused on the position M(t) of 
the right-most particle. Bramson |16|. [T7] obtained asymptotics for the median of the distribution 
of M(t), and Lalley and Sellke |47| found the asymptotic distribution of M(t). 

In 1978, Kesten [43] introduced branching Brownian motion with absorption. This process 
follows the same dynamics as branching Brownian motion except that the initial particle is 
located at x > 0, the Brownian particles have a drift of — fj,, where fi > 0, and particles are killed 
when they reach the origin. Kesten showed that there exists a critical value fi c = y/2 such that 
if /i > fi c , then the process dies out almost surely, while if /x < fi c , the process survives with 
positive probability. More recent work on this process can be found in |39| and [40j . 

Our interest in branching Brownian motion with absorption comes from its possible interpre- 
tation as a model of a population undergoing selection. To see this connection, imagine that each 
individual in a population is represented by a position on the real line, which measures her fitness. 
The fitness of an individual evolves according to Brownian motion due to mutations, and initially 
the fitness of a child is identical to the fitness of the parent. Selection progressively eliminates all 
individuals whose fitness becomes too low; we effectively imagine selection as a moving wall with 
constant speed fi. Every individual whose fitness falls beyond the current threshold is instantly 
removed from the population. 

To obtain asymptotic results as the population size tends to infinity, we consider a sequence of 
branching Brownian motions with absorption. For each positive integer N, we have a branching 
Brownian motion with absorption (X/v(i),i > 0). We consider the near-critical case, where the 
drift n depends on N and for N > 2, 



We also start the process with many particles, rather than just one, at time zero, and we make 
some rather technical assumptions on the initial conditions, which are given later in Proposition 
[TJ While (HJ) and the initial conditions may seem unnatural, they are necessary to ensure that 
the number of particles in the system stays of order N on the time scale of interest, so that the 
process can be viewed as a model of a population of size approximately N. 

We focus on understanding the genealogy of a sample from the population after a large time. 
We show that the time to the most recent common ancestor of a sample behaves like (logiV) 3 . 
Moreover we identify the limiting geometry of the coalescence tree of a sample, which we show 
is governed by a coalescent process (H(t),t > 0) known as the Bolthausen-Sznitman coalescent. 
The Bolthausen-Sznitman coalescent, which is defined precisely in subsection 11.31 is a coalescent 
process that allows many ancestral lines to merge at once. This result is in sharp contrast with 
the standard case of the Moran model or the Wright-Fisher model, where random genetic drift 
leads to a characteristic genealogical time of N generations and a genealogical tree given by 
Kingman's coalescent, which permits only pairwise mergers of ancestral lines. 

The main result of this paper can thus be stated as follows. Fix t > 0. Choose n particles 
uniformly at random from the population at time (log N) s t, and label these particles at random 
by the integers 1, . . . , n. For < s < 2nt, define IIw(s) to be the partition of {1, . . . , n} such that 
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i and j are in the same block of Hn{s) if and only if the particles labeled i and j are descended 
from the same ancestor at time (t — s/27r)(log AQ 3 . This is the standard "ancestral partition" 
of the sample. Then, with our initial conditions, we have the following result, which is stated 
precisely later as Theorem [3[ 

Main Result. The sequence of processes (I1jv(s),0 < s < 2irt) converges in the sense of finite- 
dimensional distributions as N — > oo to the Bolthausen-Sznitman coalescent (n(s),0 < s < 2irt). 

The reason for the multiple mergers is that when a particle gets very far to the right (in fact, 
at position ^5 (log N + 3 log log N + O(l))), many descendants of this particle survive for a long 
time, as they are able to avoid being killed at zero. They quickly generate a positive fraction 
of the population. As a result, when a sample of particles is taken far into the future, many of 
their ancestral lines get traced back to this particle and coalesce at nearly the same time. Our 
result is accompanied by Theorem [21 which gives the evolution of the total number of particles 
Mjv(t) in the system. Under the same assumptions, Mjv((log N) 3 t)/(2irN) converges in the sense 
of finite-dimensional distributions towards a continuous-state branching process with branching 
mechanism &(u) = au + 27r 2 ulogu for some constant a € R. 



1.1 Related models and conjectures 

Our inspiration for this model comes from the work of Brunet, Derrida, Mueller, and Munier [211 
22j concerning the effect of natural selection on the genealogy of a population. They considered a 
model of a population with fixed size A" in which each individual has a fitness. They assumed that 
each individual has k > 2 offspring in the next generation, and that the fitness of each offspring 
is the parent's fitness plus an independent random variable with some distribution \i. Of the 
kN offspring, the A" with the highest fitness survive to form the next generation. This process 
repeats itself in each generation. Brunet et. al. [21} [22] gave a detailed and intricate, but not 
mathematically rigorous, analysis of this model and arrived at the following three conjectures: 

1. If L m is the maximum of the fitnesses of the A^ individuals in generation m, then L m /m con- 
verges almost surely to some limiting velocity vjy. Furthermore, the limit Voq = limjv-»oo vn 
exists, and there is a constant C such that 

C 

Voo-V N ~- — . (2) 

(log N) z 

2. If two individuals are sampled from the population at random in some generation, then the 
number of generations that we need to look back to find their most recent common ancestor 
is of order (logA^) 3 . 

3. If n individuals are sampled from the population at random in some generation and their 
ancestral lines are traced backwards in time, the coalescence of these lineages can be de- 
scribed by the Bolthausen-Sznitman coalescent. 

This model is similar to a branching random walk in which the positions of the particles 
correspond to the fitnesses of the individuals. Indeed, this model would be precisely a branching 
random walk if all individuals were permitted to survive. The limiting velocity Voo that appears 
in the first conjecture is the limiting velocity of the right-most particle in branching random walk, 
which was studied in the 1970s by Kingman [33], Hammersley [37], and Biggins [lOj. Interest 
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in variations of the branching random walk in which the number of particles stays fixed is more 
recent. Berard and Gouere [5] recently proved the first conjecture in the form stated above, in 
the case k = 2, under suitable regularity conditions on fj,. Their proof builds on previous work 
of Gantert, Hu, and Shi [34J and Pemantle |58j . See also the work of Durrett and Mayberry 
[29] . who considered a model very similar to this one while studying predator-prey systems, and 
Durrett and Remenik [30] . 

The analysis of Brunet et. al. involves studying solutions u(x, t) to the noisy FKPP equation 



du d u 2 u(l — u 

m = d^ + u ~ u + V^v 



2 + u-u A + \j \ T ' W(x,t), (3) 



where W(x, t) is space-time white noise. If the noise term were removed, this partial differential 
equation would be the well-known FKPP equation, which was introduced in 1937 by Fisher [33] 
and by Kolmogorov, Petrovskii, and Piscunov [46] and is one of the simplest nonlinear partial 
differential equations that admits traveling wave solutions. The link between the FKPP equation 
and branching Brownian motion has been known since the work of McKean [54j , who showed that 
if M(t) denotes the position of the right-most particle at time t for branching Brownian motion 
with variance parameter 2 and u(t, x) = P(M{t) > x), then u is the unique solution to the FKPP 
equation with the initial condition u(0,x) = ls x <0}- In Harris, Harris, and Kyprianou use 
branching Brownian motion with absorption to give a probabilistic analysis of solutions to the 
FKPP equation. 

The first conjecture above can also be expressed as a conjecture about the velocity of solutions 
to equations such as (|3]). This form of the conjecture goes back to the work [18] of Brunet and 
Derrida, who refined their analysis and simulations in |19U20j . In this form, the conjecture states 
that the velocity of traveling wave solutions to the original FKPP equation exceeds the velocity 
of solutions to the equation ([3|) with the noise term by a quantity that is of the order l/(log N) 2 . 
Recently Mueller, Mytnik, and Quastel [55] proved this result. 

With the first conjecture having been largely settled, the purpose of the present paper is to 
provide rigorous versions of the second and third conjectures. As explained above, the model 
that we work with is not exactly the model studied in [21} I22j. Instead, to simplify the analysis, 
we replace branching random walk by branching Brownian motion, and rather than keeping the 
population size exactly fixed, we control the population size by killing particles that drift too far 
to the left. Note in particular that with our choice ([1]), 
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as N — > oo, which matches precisely ([2]) in Conjecture 1 above. Models with nonconstant 
population size where already discussed by Derrida and Simon in [25} [65] using nonrigorous 
methods. Although they do not study genealogies, their analysis strongly suggests that a result 
similar to ours may be expected. 

Another question of interest related to these nearly critical branching particle systems (in 
the sense that the drift fi of particles is slightly above the critical value \i c = v2) ; concerns 
asymptotics for the survival probability. This is a topic that has attracted a considerable amount 
of attention in recent years; see, for example, [H EJ EJ [32l [Ml HOI HI] • In [3], we use the techniques 
developed here to derive fairly sharp estimates for the survival probability of nearly critical 
branching Brownian motion. 
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We also emphasize that the Bolthausen-Sznitman coalescent describes precisely the ultramet- 
ric structure that is expected to emerge in the low-temperature regime of mean-field spin glass 
models such as the well-known Sherrington-Kirkpatrick model. This is perhaps not a coincidence, 
as the model which we study here may be seen as a degenerate form of spin glass models, with 
the position of the particles being approximately given by a Gaussian field with a covariance 
structure which is closely related to their genealogy. 

1.2 Continuous-state branching processes 

A continuous-state branching process is a [0, oo]-valued Markov process (Z(t),t > 0) whose 
transition functions pt(x,-) satisfy 

p t (x +y,-)= p t (x, •) * p t (y, •) for all x, y > 0. 

That is, the sum of independent copies of the process started from x and y has the same law 
as the process started from x + y. Continuous-state branching processes were introduced by 
Jirina |42j . Lamperti [H] showed that continuous-state branching processes are precisely the 
processes that can be obtained by taking scaling limits of Galton- Watson processes. Lamperti 
[19] and Silverstein [M] observed a one-to-one correspondence between continuous-state branching 
processes and Levy processes with no negative jumps, by showing that it is possible to obtain 
any continuous-state branching process through a time change of the corresponding Levy process 
(see also |23| for a very readable account of this theory and proofs). 

If we exclude processes that can make an instantaneous jump to infinity, continuous-state 
branching processes can be characterized by a function ^> : [0, oo) — > K of the form 

roc 

^(u) = au + f3u 2 + / (e~ ux - 1 + uxli x <W) v{dx), 
Jo 

where a G R, (3 > 0, and v is a measure on (0, oo) satisfying / °°(1 A x 2 ) v(dx) < oo. The function 
^ is called the branching mechanism. If (Z(t),t > 0) is a continuous-state branching process 
with branching mechanism ^, then for A > 0, 

E[e- xz W\Z(0) = a} = e ~ aUti - x \ (5) 

where the function 1 1— >■ ut(X) is a solution to the differential equation 

^Ut(A) = -*(ut(A)), n (A) = A. (6) 

Neveu [57] studied the continuous-state branching process with = it log it. We will be 

interested, more generally, in a continuous-state branching process (Z(t),t > 0) whose branching 
mechanism is of the form ^(u) = au + bit log it, where a G R and b > 0. In this case (see, for 
example, p. 256 of [7J), there exists a real number c such that 

POO 

= -cu + b (e~ ux - 1 + uxli x <n) x~ 2 dx. 
Jo 

Also, it is not difficult to solve © to obtain 

Ut (A) = \ e ~ bt e a ( e ~ bt ~ 1 ^ b . (7) 

Because l/\J/(it) du = oo for all 5 > 0, the process does not explode. That is, almost surely 
Z(t) < oo for all t. Because J s l/^(u) = oo for all 5 > 0, the process does not go extinct. That 
is, almost surely Z(t) > for all t. Proofs of these facts can be found in [36J. 
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1.3 The Bolthausen-Sznitman coalescent 

In mathematical population genetics, it is standard to represent the ancestral relationships among 
a sample of n individuals using a coalescent process (U(t),t > 0), which is a continuous-time 
Markov process taking its values in the set of partitions of {1, . . . , n}. Here 11(0) is the partition 
of {1, . . . , n} into n singletons, and blocks of the partition merge over time. The merging of blocks 
of the partition corresponds to the merging of ancestral lines when the ancestral lines of the n 
sampled individuals are traced backwards in time. The standard coalescent model is Kingman's 
coalescent. Kingman's coalecent was introduced in [45] and is now the basis for much work in 
mathematical population genetics. Kingman's coalescent has the property that only two blocks 
of the partition ever merge at a time, and each transition that involves two blocks merging into 
one happens at rate one. 

Within the last decade, alternative models of coalescence, allowing for multiple ancestral 
lines to merge at once, have been studied in some depth. These coalescent processes, known as 
coalescents with multiple mergers or A-coalescents, were introduced by Pitman [59] and Sagitov 
|62j . If A is a finite measure on [0, 1], then the A-coalescent has the property that whenever there 
are b blocks, each transition that involves merging k blocks of the partition into one happens at 
rate 



Kingman's coalescent is the special case of the A-coalescent in which A is the unit mass at zero. 

If A is the uniform distribution on [0, 1], then the A-coalescent is known as the Bolthausen- 
Sznitman coalescent. The Bolthausen-Sznitman coalescent was introduced in [14] in the context of 
Ruelle's probability cascades. The Bolthausen-Sznitman coalescent has been studied extensively, 
and has been found to be related to stable subordinators [9] and random recursive trees [35] . It 
also shows up in Derrida's generalized random energy model [15j . Properties of the Bolthausen- 
Sznitman coalescent have been worked out, for example, in [59\ [27] [2]. 

Bertoin and Le Gall [7] showed how to define precisely the notion of the genealogy of a 
continuous-state branching process. They found that the genealogy of Neveu's continuous-state 
branching process is given by the Bolthausen-Sznitman coalescent. These results were extended 
in [13] , where it was shown that the genealogy of any continuous-state branching process whose 
branching mechanism is of the form ^(u) = au + bu log u can still be described by the Bolthausen- 
Sznitman coalescent. This connection between the Bolthausen-Sznitman coalescent and Neveu's 
continuous-state branching process played a central role in Bovier and Kurkova's analysis of 
Derrida's generalized random energy model [15]. A survey of this material can be found in [1]. 

1.4 Main results 

Recall that for each positive integer N, we have a branching Brownian motion (Ajv(i),i > 0). 
We denote by Mjy(i) the number of particles at time t, and we denote the positions of these 
particles by ATi^i) > A^A^i) > • • • > X MN ^ t ^ N {t). We further define the process (Z^(t),t > 0) 
by setting 





(8) 



and then letting 




(9) 
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Note that only particles to the left of L contribute to Zn(£), and the level L depends on N. As 
we will see later, Z^{t) is a good measure of the "size" of the process at time t, in the sense that 
it predicts the number of particles shortly after time t. Also let 

M N (t) 

Y N {t)= e^ N{t) . (10) 
i=i 

We will see that as the branching Brownian motion evolves, most particles stay well to the left 
of L, and as long as this is the case, the number of particles changes little. However, occasionally a 
small number of particles get very far to the right. Because the descendants of these particles are 
able to avoid the barrier at zero, the number of particles increases rapidly. Indeed, the increase 
in the number of particles is so rapid that when we take the scaling limit as N — > oo, we get a 
process with jumps. The proposition below shows that this limiting process is a continuous-state 
branching process. 

Proposition 1. For all positive integers N, define the process (V]y(t),t > 0) by 

V N (t) = m ^ 2 Z N ((lo g N)H). (11) 

Suppose as N — ^ oo, the distribution o/Vjv(0) converges to v, where v is a probability distribution 
on [0, oo). Suppose also that Y/v(0)/A(log A) 3 converges to zero in probability dsJV-f oo. Then 
there exists a constant a £ K such that as N — > oo, the finite- dimensional distributions of the 
process (V]\i(t),t > 0) converge to the finite- dimensional distributions of the continuous-state 
branching process with branching mechanism ^(u) = au + 2TT 2 ulogu started with distribution v 
at time zero. 

The condition on V/v(0) ensures that the number of particles in the system is of order N, 
as shown below with the scaling in Theorem [2j The condition on Y/y (0) ensures that no single 
particle at time is likely to have descendants that constitute a large fraction of the population 
a short time later. If we begin with A particles in what is a relatively "stable" configuration, 
then the initial conditions will hold. Furthermore, as shown in Proposition 3 of [3j, if there is 
initially a single particle near L, then these conditions will be satisfied after a time of order L? . 

Note that because the processes (VN(t),t > 0) for fixed A can increase very rapidly in a 
short time but do not have large jumps, the sequence of processes (Vat, A > 1) is not tight, and 
convergence in the Skorohod topology does not hold. 

The theorem below converts this result about the scaling limit of (Z]y(t),t > 0) to a result 
about the number of particles. This convergence result holds only for t > 0. The hypothesis at 
time t = still involves the processes (VN(t),t > 0), which may not imply convergence of the 
number of particles at time zero. The result needs to be stated in this way because it is the value 
of Zpf(t) rather than M^{t) that predicts the number of particles that will be alive a short time 
later. 

Theorem 2. Assume the hypotheses of Proposition [7] hold. Then as N — > oo, the finite- 
dimensional distributions of the process 

' M N ((logN)H),t >0 



2vrA 
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converge to the finite-dimensional distributions of the continuous- state branching process with 
branching mechanism ty(u) = au + 2tt 2 u log u started with distribution v at time zero, where a is 
the constant from Proposition d 

The next result shows that if we pick n particles at random from branching Brownian motion 
with absorption at some time and trace back their ancestral lines, the resulting process, properly 
scaled, converges to the Bolthausen-Sznitman coalescent. This is a precise formulation of the 
result stated in the introduction. Choose n particles uniformly at random from the M7\r((log iV) 3 t) 
particles at time (log N) 3 t, and label these particles at random by the integers 1, . . . , n. Fix t > 0. 
For < s < 2irt, define II v(s) to be the partition of {1, . . . , n} such that i and j are in the same 
block of IIjv(s) if and only if the particles labeled i and j are descended from the same ancestor 
at time (t — s/27r)(log N) 3 . Let (n(s),0 < s < 2irt) be the Bolthausen-Sznitman coalescent run 
for time 2nt and restricted to {1, . . . , n}. 

Theorem 3. Assume the hypotheses of Proposition [7] hold, and assume that ^({0}) = 0. Then 
as N — ^ oo, the finite- dimensional distributions of (IIv(s),0 < s < 2-irb) converge to those of 
(n(s),0 < s < 2vrt). 

As discussed earlier, this result is, of course, the analog for this model of the third conjecture 
of Brunet et. al. |21| 122] stated above. The (logiV) 3 time scaling that appears here, as well as in 
Proposition [T] and Theorem O matches the second conjecture stated above. If two particles are 
chosen at random, the time back to their most recent common ancestor is of the order (logiV) 3 . 

1.5 Overview of the proofs 

Because the proofs of Proposition [TJ and Theorems [2] and [3] are rather long, we outline the basic 
strategy here. The key idea is to treat separately the particles that reach approximately the level 
L. These are the particles that will produce a large number of descendants within a short time, 
leading to jumps in the population size when we look forwards in time and multiple mergers of 
ancestral lines going backwards in time. 

The first step, carried out in Section 2, is to collect some results that we need pertaining to 
branching Brownian motion in a strip, which are important both for the proofs and for under- 
standing the heuristics behind our choices of parameters. Most importantly, we observe that if 
a branching Brownian motion is started with a single particle at x and particles are killed upon 
reaching or L, then the expected number of particles in a set B at a sufficiently large time t is 
approximately f B pt(x,y) dy, where 

p t (x, y) = \ e^/^/^t . ^ gin f*x\ e ., y g . n (12) 

From this formula, we can make several observations concerning the behavior of the branching 
Brownian motion. First, note that the time parameter t appears in the formula only in the first 
exponential factor, so the population size should be roughly constant over time provided that 
1 — li 2 /2 — 7r 2 /2L 2 = 0. Indeed, we have chosen the parameters \i and L above (see ([I]) and (JSJ) 
to satisfy this equation, as this is the drift needed to stabilize the population size. Second, notice 
that the formula is proportional to eP® sin(ivx/L), which will equal Z^{t) if we sum over the 
positions of all particles at time t. Thus, it is Zn(£) that predicts the number of particles that 
will be in a given set at a later time, which is why Z^{t) provides a useful measure of the "size" 
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of the process. Third, notice that the formula is proportional to sm(iry/L). Consequently, 
regardless of the starting configuration, once t is large enough for the approximation to be valid, 
the particles will have settled into a "stable" configuration in which the "density" of particles at 
position y is proportional to e~^ y s'm(iry/L). We will see in Lemma [5] that this approximation 
becomes accurate when t gets to be larger than (log iV) 2 . 

If we begin at time zero with N particles that are approximately in the stable configuration, 
so that their "density" is CLe~ m s'm(iry/L), where CL is a normalizing constant, then the value 
of Zn(0) should be approximately 



which is of the order NL 2 . On the other hand, if we begin instead with a single particle at L, then 
one can show typically the right-most descendant of this particle will reach a level that exceeds L 
by only a constant. This is essentially true because critical branching Brownian motion dies out, 
and can be seen from Proposition [16] below which shows that particles reach L at a much faster 
rate than they reach any level that is much greater than L. Consequently, we can estimate the 
typical contribution of the descendants of this particle at time t by using (]12p with L in place of 
x and L + a in place of L, where a > is a constant. This means that the value of Z^{t) should 
be of the same order as 



which is of the order L~ 1 e llL . We have chosen L so that particles that reach L produce substantial 
increases in the population size. Indeed, note that L~ l e flL and NL 2 are of the same order precisely 
when L is within a constant of the value in ([Sj) . 
In Section 3, we therefore define 



where A £ R, and study the particles that stay to the left of La- That is, we consider branching 
Brownian motion with particles killed at and at La- Using (|12p . it possible to estimate first and 
second moments of various quantities. In subsection 13 .1\ we apply these results to calculate the 
first and second moments of Zjv(i), conditional on the process a time #(logiV) 3 earlier, where 9 is 
a small constant. The first moment calculation is Lemma [TTl while the variance bound appears 
in Lemma [T2l The variance bound is sufficient to establish that when A is large, there is a Law of 
Large Numbers, with the value of Z^(t) being close to its expectation. A similar variance bound 
for the number of particles is given in Lemma PHI Such results would not be possible without the 
truncation at La, because without truncation the expected number of particles is dominated by 
rare events in which one particle moves far to the right and produces a large number of surviving 
offspring. The analysis in Section 3 is motivated by some of the arguments based on moment 
bounds in [43] . 

In subsection 13.21 we tackle the question of how many particles reach the level La- An estimate 
of the expected number is given in Proposition [16] From this result, one can deduce that if we 
start with N particles that are in approximately the "stable" configuration described above, then 
the time that it will take before a particle reaches La is of the order (log N) 3 , which explains the 






(13) 
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(log iV) 3 time scaling in our main results. To see heuristically why this scaling occurs, note that 
if f3 > is a constant, then the number of particles between L — f3 and L at time t is of the order 



which is of the order l/(log N) 3 . Such particles have a positive probability of reaching L between 
times t and t + 1, but the calculation in Proposition [16] shows that particles that are more than 
a constant distance from L at time t are unlikely to hit L by time t + 1. Thus, 0(1/ (log N) 3 ) 
particles hit L per unit time. 

Since branching by particles close to La may enable several particles to hit La at nearly the 
same time, we also require the second moment estimate in Proposition [18] to establish that the 
expected number of particles that reach La within a time interval of length #(log iV) 3 , conditional 
on at least one particle reaching La, is bounded by a constant. Then in subsection 13.3^ we show 
in Proposition [23] that a "good" event on which the bounds in subsections 13.11 and 13.21 are valid 
occurs with high probability. 

In Section [U we begin to consider the contribution from particles after they reach the level 
La- The key to this analysis is Proposition 1241 which comes from [56J. This result states that if 
a particle starts at La and y is a large constant, then the number of descendants of the particle 
that reach La — y is approximately y~ l e^ y W , where W is a random variable. Some analysis 
that involves a Tauberian theorem leads to Proposition 1271 which says that for large x, we have 
P(W > x) ~ B/x. Conceptually, this result is the reason why the genealogy of the population 
is described by the Bolthausen-Sznitman coalescent. The contribution to the population of the 
particle at La will be approximately proportional to the number of descendants that hit y, if y is 
sufficiently large. The fact that a jump of size greater than x results from a particle at La with 
probability proportional to 1/x implies that the Levy measure of the limiting continuous-state 
branching process will have a density proportional to x~ 2 , which in turn leads to the duality with 
the Bolthausen-Sznitman coalescent. 

In Section [5] we show how to combine all of the previous estimates to get sharp results for 
the behavior of the process (Z]\f(t),t > 0). The key results are Proposition 1391 which bounds the 
expected change in Zjv over a time interval of length #(log iV) 3 when there is no large jump, and 
Proposition 141} which estimates the probability that Zjv increases by at least rN(logN) 2 over 
a time interval of length #(logiV) 3 . These estimates on how the process behaves over a short 
time interval can be matched with the infinitesimal generator of the continuous-state branching 
process. This work is done in Section [6] and leads to a proof of Proposition [T] Once Proposition 
Q]is established, we are able to prove Theorem [2] by arguing that the value of Zj^(t) can be used 
to predict accurately the number of particles shortly after time t. 

The proof that the genealogy of the process converges to the Bolthausen-Sznitman coalescent 
is completed in Section [7] We represent the genealogy of the branching Brownian motion using 
a "flow of bridges", a tool introduced by Bertoin and Le Gall in [8]. Using Proposition [T] and 
Theorem [2] we establish convergence to the flow of bridges associated with the continuous-state 
branching process, which is known to correspond to the Bolthausen-Sznitman coalescent. 

1.6 Notational conventions and index of notation 

For the benefit of the reader, we include in the table below an index of some of the notation that 
is used throughout the paper. 
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A used to control the level at which particles are killed; see the definition of La- 

Gn.u event that Z^{tj) and Y^{tj) are sufficiently small for j < k. 

Gn(e) event that G^^ occurs for all k. 

h(N) slowly increasing function used to upper bound Y^r. 

L level, given by ([8]), such that descendants of a particle that get near this level 

will likely constitute a significant fraction of the population in the future. 
La level at which particles are killed, defined in (|13p , 

Mjs[{t) number of particles at time t. 

Rk number of particles killed at La between tk-\ and tk- 

s the process Zn is often studied between times u(log N) 3 and (u + s)(log iV) 3 . 

tk the process Zjy is frequently studied at the times tk- 

u the process Zn is often studied between times it(log iV) 3 and (u + s)(log iV) 3 . 

Vn normalization of the process Z/v, defined in (fTT|) . 

Xpf(t) the branching Brownian motion at time t. 

Xi t N(t) position of the ith particle from the right at time t. 

y large constant; the number of descendants of a particle at La that reach 

La — y plays a central role in the paper. 
ljv(t) weighted sum of particle positions at time t, defined in (fTOl) . such that a 

particle at x contributes e^ x to the sum. 
Zjv(i) measure of the "size" of the process at time t, defined in ([9]), such that a 

particle at x < L contributes e^ x s'm^nx/L). 
Zna similar to Zn, but with particles killed at La, defined in (|36p . 
Z' N1 similar to Zn,i, with La used in place of L in the sine function; see (137)) . 
Z y number of descendants of a particle at zero that reach —y. 

5 small constant used to bound the error in an estimate of a branching process 
limit; see ([96j) . 

e small constant used to bound Zfq above by £ -1 / 2 iV(logiV) 2 . 

fj, drift of the branching Brownian motion, given by (H|). 

r] small constant used to bound the difference between Z y and its limit. 

6 small constant such that tk and tk+i are #s(logiV) 3 apart. 

C large constant chosen so that with high probability, descendants of a particle 

at zero will have reached — y by time £. 



Some constraints on the constants e, A, and 9 are introduced at the beginning of Section 
3; see equations (f32l) - ([35l) . Further constraints on these constants, as well as the choices of the 
constants S, rj, y, and (, are set out in Subsection 5.1; see equations ([95|) - (|106p . 

Throughout the rest of the paper, C will denote a positive finite constant whose value may 
change from line to line. The constant C may depend on u and s, but may not depend on 
N or on the seven constants e, A, 9, 5, r], y, and £• We say a sequence of random variables 
(Rn)n=i 1S °(f{N)) if for any choices of the constants u, s, e, A, 9, 5, rj, y, and ( satisfying the 
constraints mentioned above, there is a deterministic sequence (bzv)jv = i tending to zero such that 
\Rn\ < bwf(N) for all N. Note in particular that throughout this paper, the bounds implicit in 
the notation o(l) or o(f(N)) are nonrandom and depend solely on the choices of parameters. 

Also, if g is a function of some of the constants e, A, 9, 6, rj, y, £, and N, we will occasionally 
use the notation 0(g(e, A, 9, 8, rj, y, (, N)) to denote an expression whose absolute value is bounded 
by Cg(e, A, 9, 5, r/, y, (, N), where C is defined as above. 
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2 Branching Brownian motion in a strip 



Suppose (B t )t>o is Brownian motion started at x, with < x < K, and assume the process is 
killed when it hits or K. Then (see, for example, p. 188 of [50J) the density of the process at 
time t, restricted to (Q,K), is 

*(x,y) = ^ 2t/K2 («x/K,«y/K) = 1 fj e -^»W <*» (ir) <** ) ■ (") 

Consider now branching Brownian motion in a strip in which each particle gives birth at rate 
one, drifts to the left at rate (i > 0, and is killed upon reaching or K. We will need to estimate 
the expected number of particles at time t when t is large. Suppose there is initially a single 
particle at x. The density of particles at the position y at time t can be calculated using the 
well-known many-to-one lemma. The density is a product of e , which represents the expected 
number of particles at time t, a Girsanov factor e^ 1 ^^^^ */ 2 relating Brownian motion with 
drift —ji to ordinary Brownian motion, and the density of ordinary Brownian motion killed upon 
reaching or K. Therefore, the density of particles at time t is 

, x (i-,L 2 /2)t+u(x-y) 2 -n 2 nH/2K 2 ■ ( nixx\ . ( niry\ 
q t (x,y) = e K " ' > ^ y > ■ — ^ e ' sm I — J sm I — J , (15) 

n=l \ / \ / 

in the sense that if B C (0, K), then the expected number of particles in B at time t is 
J B 9t(x,y) dy. 

When t ^> K 2 , the first term in the sum in (I15p dominates. We make this more precise in 
Lemma [5] below. We first record the following trigonometric lemma. 

Lemma 4. If < y < it and n£N, then \ sinny| < nsiny. 

Proof. We prove the result by induction. The result is trivial for n = 1. If it is true for n — 1, 
then 

sin((n — l)y) cos y + cos((n — l)y) sin y\ 
sin((n — l)y)|| cosy| + | cos((n — l)y)\\ siny| 
sin((n — l)y)| + | siny| < nsiny, 

where the last step uses the induction hypothesis. □ 

By applying Lemma U] to each term in the sum on the right-hand side of (|15p . we easily get the 
following estimate. Note that pt(x, y) is simply the n = 1 term in the expression for qt(x, y). The 
error term Dt(x,y) is small when t S> K 2 and is bounded above by a constant when t > C\K 2 
for some constant C\. 

Lemma 5. Consider branching Brownian motion in a strip in which each particle gives birth at 
rate one, drifts to the left at rate n, and is killed upon reaching or K. Suppose there is initially 
a single particle at x. Let 

Pt(x, y) = l^-V/^/^ . ^ sm ^ . e -,v sin . 



sin ny\ = 

< 
< 
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Then for all x, y G [0, K], define D t (x, y) by 

Qt{x,y) 
Pt(x,y) 

Then 



1 + D t (x,y). 



\D t ( X ,y)\ < ^=^ H/2K2 • (16) 

Therefore, if B is a Borel subset of (0,K), then the expected number of particles in B at time t 
may be written as (f B pt(x,y) dy)(l + D' t (x,B)), where \D' t (x,B)\ is bounded by the right-hand 
side of [To)) . 



Using these densities, we can estimate the expected values of certain functions of branching 
Brownian motion. Lemma[6l which is Lemma 2 of [39], gives a martingale for branching Brownian 
motion in which particles are killed only at zero. Lemma [7] estimates the expected values of three 
specific functions of branching Brownian motion in a strip. 

Lemma 6. Consider branching Brownian motion in which each particle gives birth at rate one, 
drifts to the left at rate [i, and is killed upon reaching 0. Let M(t) be the number of particles at 
time t, and denote the positions of the particles at time t by X\(t), . . . ,X M ^(t). Let 

M{t) 

v(t) = x i (t) e ^w + ^ 2 / 2 - 1 )*. 

1=1 

Then (V(t),t > 0) is a martingale. 

Lemma 7. Consider branching Brownian motion in a strip in which each particle gives birth at 
rate one, drifts to the left at rate \x, and is killed upon reaching or K . Let M{t) be the number 
of particles at time t, and denote the positions of the particles at time t by X\{t), . . . ,X M ^{t). 
Let 

M(t) M(t) /^Yf+W 

Y{t) = Y j e^ t \ Z{t)=Y j e^ {t) ^i^p-\. 

i=l i=l ^ ' 

E[M{t)\ = l_ e ^l^l2K-)t (1 + Di)m J« e - m gin W\ dy 



Then 



(17) 



and 



E[Y(t)} = ^-» 2 /2-« 2 /2K 2 )t(i + D 2 )Z(0), (18) 

7T 

where \D\\ and \D%\ are bounded by the right-hand side of t!6\) . Also, 

E[Z(t)} = e^-^^^^ZiO). (19) 
Proof. To prove (|17p . first suppose there is initially a single particle at x. Lemma [5] gives 

E[M(t)} = (J K Pt (x, y) dy^j (1 + D x ) 
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where \D\\ is bounded by the right-hand side of (|16p . The result now follows by summing over 
the particles at time zero. 

Likewise, to prove (I18p . assume there is initially a single particle at x and observe that Lemma 
[5] gives 

E[Y{t)] = (J e™p t (x, y) dy^j (1 + D 2 ), 

where \D%\ is bounded by the right-hand side of (|16p . Using 

f K . firy\ 2K 

we get 

E[Y (t)] = l e (i-M a /2-*72* a )* e ^ sin ( ™\i + D 2 ). 

IT \ K J 

The result again follows by summing over the particles at time zero. 
To obtain (fT9|) . note that if n is a positive integer, then 

K ■ firy\ ■ f™y\ , f K/2 ifra = l 
sin — — sin — — \ ay — ' 



o 



K V K J * 10 ifn>2 



If at time zero there is just a single particle at x, then 
E[Z(t)) = eW sin q t (x, y) dy 

n=l " " 







sm 



- 



J \K J' 

As before, the result now follows by summing over the particles at time zero. □ 

For the next result, we will need the Green's function for Brownian motion in a strip. Let 
(Bt,t > 0) be one-dimensional Brownian motion without drift. Define the Green's function 
G(x,y) such that if (Bt,t > 0) is Brownian motion started from Bq = x G (0,K) and if r = 
inf{i : Bt ^ (0,K)}, then for all bounded measurable functions g, we have 



E 



g(B t ) dt 



K 



G(x,y)g(y) dy. 



The Green's function is given by (see, for example, (4.4) on p. 225 of |28j) 

G(x v) _I 2x{K-y)/K ify>* 

G ^y>-\ 2y(K-x)/K ify<x. (20) 

To obtain this result from (4.4) in [28], observe that in the notation of [28], we have ip(x) = x 
and m(x) = 1 for ordinary Brownian motion. If y < x, then 2y(K — x)/K < 2x(K — y)/K. 
Therefore, for all x,y G [0,-RT], 

G(x,y) <2x(K-y)/K. (21) 

To control the fluctuations, we will also need a result about second moments. The following 
result, which is a slight extension of Lemma 3.1 of |43| . will be a useful tool. 
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Lemma 8. Consider branching Brownian motion with particles killed at both and K . Assume 
that at time zero there is just a single particle at x, and that the particles at time t are denoted 
by X\(t), . . . , Xj^ft\(t). Let f : (0,K) — > [0, oo) be a measurable function. Then 



E 



M C) \ 2-, r K r t r K / rK \ 2 

i{ x , z )\ I f(y)qt-s(z,y) dy) dzds. 



(£/(*(t))) f(y) 2 q t (x,y)dy + 2^ jf 



Proof. For a Borel set j4 C (0, K), let N A (t) be the number of particles in the set A at time t. 
Equation (2.8) of [63] gives 

E[N A (t)} = f q t (x,y)dy, (22) 
J A 

while equations (2.11) and (2.12) of [63J give 

E[N A (t)N B (t)] = E[N AnB (t)] + 2 [ [ q s (x,z)( f q t -s(z,w)dw]( f q t - s (z,y) dy] dz ds. 



JO \JA / \JB 



(23) 

Suppose / is a simple function, so that 



(M(t) -2-i mm 

J— 1 / -I »•— 1 „•— 1 



i=l 

where the are disjoint Borel subsets of (0, K) and the dj are positive real numbers. In this 
case, we have 

i: 

i=l y J i=l j=l 

It is now straightforward to check, using (|22p and ()23|) . that the conclusion of Lemma [8] holds 
in this case. Since every nonnegative measurable function can be approximated from below by 
simple functions, the general result then follows from the Monotone Convergence Theorem. □ 

Lemma 9. Assume we are in the setting of Lemma [?| Assume that at time zero there is just 
a single particle at x. Suppose that 1 — /u 2 /2 — ir 2 /2K 2 < 0. Also, assume there exist positive 
constants C\ and C2 such that C\K 2 < t < 6*2/(1 — /^ 2 /2). Then there exists a constant C, 
depending on jj,, C\, and C2, but not on x or K , such that 



E[Z(tf]<CeV K (^ + ±y 



Proof. We apply Lemma [8] with f{y) = e^ y sm(Try/K) to get 

rK /„„.\2 



E[Z{tf} = J e 2 M S m(^j q t (x,y)dy 

+ 2 J o J Q qs ^ z ^{J ^ V ^ n {^\t-s{z,y)dy\ dzds. (24) 
We begin by bounding the first term in (|24p . By Lemma for all x,y £ [0,K] we have 

qt(x, y) < sin fe) sin fa£\ , (25) 
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where we are using that 1 — /x 2 /2 — 7r 2 jlK 2 < 0. The assumption t > C\K 2 ensures that the 
error term from Lemma [5] can be bounded by a constant (throughout the proof, we allow the 
value of C to change from line to line) . Note that 

K sin (^]dy= f sin ( ^lA\ dy 



o 



= e» K J* e-™ S m(jfj dy 



Here we are using that \i > and that C may depend on \i. Now using (|25j) and (|26j) and the 
bound sin(7ry/ET) 2 < 1, we get 

^ e 2 ^ sin ( ^r) \ t {x, y)dy<g [* e ^e^ sin ( ^) sin ( ^) dy 



K J K J \K J \K J 

S — /„ e»sm^j rfy <^^. (27) 

It remains to bound the second term in (|24|) . Recall that y), defined in (|14p . denotes the 
density at time t of Brownian motion started at x and killed when it reaches or K. Note that 



v s (x,y) ds = G(x,y), 

where G(x,y) is the Green's function in (|20p . Since t < Ci/(1 — /^ 2 /2), we also have for s <t, 

q s (x, y) = e^-yW-» 2 / 2 K s (x, y) < Ce^-^v s (x, y). (28) 
Since t > C±K 2 , the bound ([25]) is valid for q t - s (x, y) when s < t/2. Using these results and (f2Tj) . 

J e m sin qt-s(z, y) dy\ dz ds 

< J* Ce^ x - Z ^v s {x, z) i J* sin ■ j^ef**-v) sin sin dy) dz ds 



t/2 r K / j-K 

aJx,z) 



(~i p lix ft/2 fK 



K 2 



Jj j o e^ s (x,z)sin(f ) Q sin(^) dy) dz ds 



< CeT I e^ z sin fe^ ( J* v s (x, z) ds) dz 
<Ce^j\^sin^ 22x{K K Z) dz 

<Ce^f\^J£gl dz <^, (29) 

where for the third inequality we used that sin(7ry/ET) 2 < 1 and for the next-to-last inequality 
we used that sm(irz/K) = sm(ir(K — z)/K) < (K — z)/K and x/K < 1. 
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Next, let v' t (x, y) be the density at time t of Brownian motion started at x and killed when it 
hits 0. By the Reflection Principle, for s <t, 



K 



yv' s (x,y) dy 



1 



Ins 
1 



K 



K-x 



< 



Itts J- 
1 



'2-KS 



(z + x)e~ z2/2s dz 



ze -* 2 l^dz+ lr 



/ 2tts 



K+x 



(z - x)e~ z2 ' 2s dz 



'2tts 



e - z2 ' 2s dz = 2x. 



Therefore, using that v s (x,y) = v s (K — x,K — y) < v' s (K — x, K — y), 



j Qs(x,z)(^J sin (^ S jq t - S {z,y)dy^ dz ds 



< 



t ,-K 



c 



e^ x - z) sin 



< 



< 



t/2 JO 
Ce »x 

K 



K 

t pK 



Tr.r 



sin 



t/2 JO 

t r K 



K , 

7TZ \ < <- K 

K 



nz\ < <- K 
K 



s"t*-») sin (^pj v t - s (z, y) dy^j dz ds 



o 



Try 



sin I — )v t - 8 (z,y) dy ) dz ds 



K 



t/2 JO 



TTZ 

~K 



< 



< 



Ce^ x f l f K 

Cel MX 



K 



it 2 So eM2Sill (?)(i) VVlt - S ^ K ~ Z ' ^ ^ 



K-y 
K 



v t - s (z,y)dy) dz ds 



K 3 



< 



't/2 

Ce^ x t f K 



K 4 



[ e» z (K- zf dz< 
Jo 



Cef MX e u,K t 



K 4 



The result follows from (12711 . (129 1) . and (jM 



(30) 
□ 



3 Particles hitting the right-boundary 

Recall that we are considering (Xn (t),t > 0) , which is a branching Brownian motion with drift 
—fj, and killing at the origin. Recall also that Proposition Q] involves the processes (Z^(t),t > 0), 
where Zpj(t) is a weighted sum of the positions of the particles at time t. Throughout this entire 
section, as well as sections 5, 6, and 7, we assume that the hypotheses of Proposition Q] hold. 



3.1 The particles that never reach La 

To prove Proposition [U we will need to consider these processes at two times u and u + s, where 
< u < u + s. Fix a small number 6 > such that 6~ l 6 N. For < k < (9 _1 , define the time 
tk = (u + 6 ks) (log N) 3 . We will be interested in the value of the process Zn at the times The 
assumption that 0" 1 G N is useful for defining the sequence {tk}o<k<e- 1 - However, many of our 
results pertain to the state of the process at time t^, conditional on the state of the process up 
to time For these results, the assumption G N is not necessary. 
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Since Yn(0)/N (log iV) 3 converges in probability to zero, there exists a nonrandom function h : 
N -> (0,oo) such that h(N) -> and (log N)h(N) -> oo as TV -> oo, and Yjv(0)/(iV(log N) 3 h(N)) 
converges in probability to zero. (This is a simple consequence of the following fact: if Xn —> 
in probability, then there exists a nonrandom sequence such that — > Q as N — > oo and 
P(X N > h N ) -> 0.) Let e > 0. For < k < 9' 1 , let Gjv,* be the event that for j = 0, 1, . . . , k, 
the following two events occur: 

• We have Z N (tj) < e~ 1 / 2 iV(log N) 2 . 

• We have Y N {tj) < N(log Nfh(N). 

Finally, let Gw(e) = G N g-i. Let (F t ,t > 0) be the natural filtration of (Xw(t),t > 0). This 
filtration, of course, depends on N, but we suppress this dependence in the notation. We will 
need to consider the conditional distribution of Z^(tk) given J : t k _ 1 - Note that the event GV,fc-i 
is in T th _ x . 

In this section, we will consider the particles that would still be alive if, between times t^-i 
and tfc, we killed particles that hit La, where La was defined in (|13|) . Recall that both La and 
the drift fi depend on N. We will always assume that N is large enough that La > and 

which is possible because, by (HJ), the right-hand side tends to e~ A as N — > oo. Because Y)v(ifc) < 
A^(log N) 3 h(N) on Gjv.fej this ensures that on GV,fc> all particles at time tk are to the left of La, 
a fact which will be invoked repeatedly in what follows. 

Note that we have defined three constants: e, A, and 9. We think of e as being small. 
Typically A will be a large positive constant, but we will also at times consider negative values 
of A. Finally, 8 will always be a small positive constant. In particular, we will assume 

9 < 1 (32) 

\A\9 < 1 (33) 

A^AOse' 1 ' 2 < e- A ' A (34) 

de A e' 1/2 < 1. (35) 

These assumptions will be in force through the rest of this section, except in Proposition [23] 
below, where it will be convenient to allow 6 to be any number with 6" 1 £ N. A stronger set of 
restrictions on 9 will then be introduced at the beginning of Section [5) 

For t G [tk-i,tk], we say i G Sit) if for all v G [£fc— i>i] 5 the particle at time v that is the 
ancestor of X^wit) is in (0,La)- Consequently, for tk-i < t < t/-, the positions of the particles 
in S(t) follow a branching Brownian motion with drift — /i, with particles killed when they reach 
or La- Define 

M N (t k ) . . 

Z N , 1 {t k )= E e^^)sin(^M)l {je5(tfc)} , (36) 



and for t G [£fc_i,ifc] define 



{ieS(t)}- (37) 
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Although our interest is in Zjv,i(tfc), we will need to approximate this random variable by Z' N i(t k ), 
which is defined in the same way except with La in place of L. The next result shows that the 
difference between these quantities is small. 



Lemma 10. On G Njk -l, both \Z' Ntl (t k -t) - Z N (t k -{)\ and E[\Z' NA (t k ) - Z N] i(t k )\\ T tk _^ 
o{N{\ogN) 2 ). 



are 



Proof. If a > 0, then 



Therefore, if < x < La, then 









a /a\ 




K x) 




cos - 








X A \x J 



a 



sin 



7TX 

~L 



8111 



TTX 

La 



\L — La\ttx 

< — 7T < 



tt\A\L a 



m.in{LA,L} 2 s/2 min-fL^, L} 2 



On GjVifc-i) & h particles at time t k -\ are to the left of both La and L for sufficiently large N. 
The indicators are therefore not needed in (|9|) and ([37|) when t = t k -\, and we get 



l^,l(*fe-i) ~~ Z N{t k -l)\ < 



Mjv(t fe _i) 
V2mm{L Al L} 2 ^ 



7T\A\L A 



■K^LAYNjtk^) 

y/2mm{L A ,L} 2 



(38) 



which is o(N(logN) 2 ) on G^,k-i- Applying the same reasoning at time t k to the particles in 
S(t k ), we get 

F\\7> <t\ 7 (t\\\T 1 < ^l^l^flivfa)!^,] 

V2 min{L^, L} z 



Note that 



7T 



71" 



2tt 2 ^ 

2 2L^ (logiV + 31oglogiV) 2 (log iV + 3 log log N - A) 2 {\ogNf 



;i+o(l)). (39) 



Since i fc - i fc _i = (logiV) 3 6>,s and ([33]) holds, equations dTSJ and fl39|) give #[Yjv(i fc )| J" tfc _J < 
CZ N (t k _ x )(A + o(l)). It follows that 



E[\Z' N ^t k )-Z Ntl {t k )\\T tk _ r ] < 



C|A|L A Z Jv fa_ 1 )(l + o(l)) 
minjLyi, L} 2 



which is o(A r (log N) 2 ) on Gjf,k-1- 

We now estimate the conditional mean and variance of Zpf t i{t k ) given J r t k _ 1 - 
Lemma 11. On Gjv,fc-i> we fraue 

£[^,1(^)1^-1] = Zjv(i fc -i)(l - 2tt 2 AOs + 0(A 2 # 2 )) + o(iV(log AO 2 ) 
JTie same bound holds with E[Z' N i(t k )\ Tt h ,] on the left-hand side. 



(40) 
□ 
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Proof. By (|19p and the Markov property of branching Brownian motion with particles killed at 
and La, we have for sufficiently large N on Gjv^-i, 

E{Z' N +{t k )\Ft k -,] = e^l^l^^-^Z'x^-i), (41) 

using the fact that for sufficiently large N, on Gjvfe-i all particles at time tk-i are to the left of 
La- Since tk — ifc-i = (log N) 3 8s, it follows from ([39]) that 

e (l-M a /2-^/2^)(**-* k -l) = e -27r^(l +0 (l)) = 1 _ 27r 2 A0s + (^ 2) + ^ ^ 

where the assumption f|33|) ensures that the error term is O{A 2 2 ). The result now follows from 
equations (|41|) and (|42p together with the two bounds in Lemma [lOj □ 

Lemma 12. Assume A > 0. On Gj^k—l> we have 

Var(Z^ 1 (t fc )|^ fc _ 1 ) < CeJV(logJV) 3 e- A (^(t fc _i) + o(iV(log iV) 2 )). 

Proof. For i e , i^] , define Z' N1 (t) as in (|57|) . and define 

M N (t) 

YN(t)= E 

i=l 

Define i^-l = sq < si < ■ ■ ■ < sm = ifc so that for some positive constants C\ and C2, we have 
Ci(logiV) 2 < s n — s n -i < C^logiV) 2 for all n. Recall that for any random variable X and any 
a- fields T and Q with J 7 G Q, we have 

Var(X|jr) = E[Yai(X\g)\T] + Vai(E[X\g]\F). 

Therefore, for 1 < n < M, we have 

V a r(Z' NA ( Sn )\T S0 ) = ^[Var^^^lJ-^JI^] + Var^^i^l^.JI^). 

Equation {TSJ implies that J5[Zj srjl (s„)|J r jB _ 1 ] = e^-^-^iiXsn-^-i)^^^). Because 
A > and thus 1 - fi 2 /2 - n 2 /2L 2 A < 0, it follows that 

Var^Z^^lJ^JI^) < Va,r(Z' N)1 (s n ^)\T S0 ). 

Therefore, 

Var(Z^ 1 (s n )|J r S0 ) < ^[Var(Z^ 1 (s n )|J c " Sn _ 1 )|J c " S0 ] + Var(Z^ Vjl (s n _i)|J : " S0 ). 
Now Vax(Z' N 1 (so)\J r So ) = 0, so by induction, 

M 

Var(^ 1 ( SM )|J- S0 ) < J^lVar^iWI^)!^]. (43) 

n=l 

Because the particles at time s n _i evolve independently between times s n _i and s n , the 
conditional variance Var(Z^ 1 (s n )| J r Sn _ 1 ) is the sum of the conditional variances of the con- 
tributions to Z' N i(s n ) from the individual particles at time s n _i. We will use the inequality 
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\ax(X\F) < E[X 2 \T] and apply Lemma [9] with K = La and t = s n — s n _i. The hypotheses are 
satisfied because 1 — /i 2 /2 — it 2 /2L 2 A < 0, and both s n — s n _i and 1/(1 — /i 2 /2) are of the order 
(logiV) 2 . Therefore, 

1 . s n S n —\ 



Var(Z^ 1 ( S „)|J- Sn _ 1 ) < Ce^Y^i)^ + 
Now e^ LA < N(logNfe~ A , so 



Var(Z^ 1 ( Sn )|^_ 1 ) < CJVOogJV)^-^^!)^ + ^f^)- (44) 
From f)18[) . we get 

max . MY^( Sn _a)| < CZ' N>1 (s )(l + o(l)) = CZ' NA (t k ^)(l + (1)). (45) 

2<n<M 

Finally, note that M < C0(logJV). Combining this with (@3j), (jUJ), and ([H]) gives that on G N , k -i, 
Var(Z^ 1 (t fe )|J r ltfc _ 1 ) = Var(Z^ 1 (s M )|-7 : ' S o) 

3 . / 1 , (logiV) 2 



< Ci\T(logiV) e TT + 



x (y^(ao) + C0(logiV)^ 1 (t fe _ 1 )(l + o(l))) 
< CeN(log Nfe- A + ^.iftk-i)) (1 + °(!))- (46) 

The result now follows from Lemma[lO]and the fact that Y^(t k -i) < Y N (t k -i) < N(logN) 3 h(N) 
on Gjv,fc-i- □ 

Corollary 13. Assume A > 0. On (^Nk—l? we have 

P{\Z NA (t k ) - Z N {t k ^)\ > Ae~ A l 4 N{\ogN) 2 \ F tk _,) < COe^e- 1 ' 2 ^ + o(l)). 
Proof. By the conditional form of Chebyshev's Inequality and Lemma fl~2| on Gjv,fe-l we have 

P(|^ fl (t fc ) -^kl^l^-JI > e-^GogiV) 2 !^) < 

< C9e- A ^£-y 2 {l + o{l)) (47) 

because Zjvfo-i) < e~ 1/2 iV(log A^) 2 on G Nk _i. Using d^SJ), some calculus, and the assumption 
that A > 0, we get that for N large enough that A < 3 log log iV, 



; (i- M 2 /2 _ 7r 2 /2L 2 i)(t; ,_ t)b _ i) _ , 



< 



vr 2 



2 2L\ 



6s(\ogNf < 2tt 2 AOs. 



Therefore, by (JUJ), if A < 3 log log iV then 

^[Z^^I^.J-Z^^OI < 2ir 2 A6sZ' NA (t k _ 1 ) 
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Because Z' NA [t h -x) = Z N (t k ^) + o(N (log N) 2 ) < e' 1 ' 2 N (log N) 2 + o(N (log N) 2 ) on GW,k-i by 
Lemma [TO] and 2tt 2 A6se- 1/2 < e~ A/A /2 by ([HI), it follows that for sufficiently large N, 

\E[Z' N ^(t k )\T tk ^} - Z' NA (t k ^)\ < e- A / A N(logN) 2 (48) 

on Gjv,fe-i. By Lemma [T0| on GN,k-i we have 

\Z' N>1 (t k ^) - Z N (t k ^)\ < e- A / 4 N(logN) 2 (49) 

for sufficiently large N and 

P(\Z N:1 (t k ) - Z' N>1 (t k )\ > e-^N^ogNflTt^) -> (50) 

uniformly as N — > oo on Gjsr >k -%. The result follows immediately from (|47p . (|48p. (|49|) . and 

dSOD- □ 

Proposition 14. Suppose A = 0. Lei 

Mjv(ifc) 
t=l 

6e the number of particles at time t k whose ancestor at time t is in (0,L) for all t G [t k -i,t k ]. 
On Gtfk-i, there exists a constant C such that 

Var(M^(t fe )|J' tfc _ 1 ) < C0e-V 2 N 2 (1 + o(l)). 

Proof. As in the proof of Lemma [12} the conditional variance can be bounded by the sum of the 
variances of the contributions to M' N (t k ) from the individual particles at time t k _\. The variance 
of the contribution from a particle at x can be bounded by the expected square of the number 
of descendants of this particle at time t k . This expectation is given by Lemma [8] with f(x) = 1 
for all x and t k — t k -\ in place of t. Therefore, 



" pLi 

Var(M^(t fc )|J r tfc _ 1 ) = V / q tk _ tk _ 1 (X hN (t k - l ),y)dy 

i=i J o 

+ 2 ^ y J %_ ifc _ 1 (X iiA r(t fc _ 1 ),z)^y q tk - t (z,y) dyj dzdt. 



The first term is S[Mj v -(t fc )|J : t fe _ 1 ], which by {I7J) with if = L is at most CZ N (t k -i)(l + o(l))/L 2 
because the integral on the right hand side of (|17p is of the order 1/K. This expression is o(N 2 ) 
on Gn,u-i- 

The argument to bound the second term is similar to the proof of Lemma [9] but requires 
splitting the outer integral into four pieces. First consider the piece between t&_i and + 
(log iV) 2 . lit < (logiV) 2 , then {25) holds and 

v t (x, y) ds = G(x, y) < 2x{L ~ v) < 2(L - y) (51) 
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by (EH). Since 1 - fi 2 /2 - tt 2 /2L 2 = 0, Lemma! gives that on G^k-i, 

I qt-tk-^Xi^fa-i)^)^ q tk - t (z,y) dy^j dz dt 

' TTZ\ _ 



A%(tfe-i) .^.^(log/V) 2 j-L 



M N {t k -\) f (\ogN) 2 r L 



i=i 



/o 



q t (X i>N {t k -i),z) 



L 



< 1 I — 1 1 ' sin 



dy ) dt 



-Jl z2 J J Qt(X ltN (t k . 1 ),z)e 2 ' Iz sm^—j dz dt 

c A%(t fc _i) 

- 1* E 



e 2 ^ 2 sin 



i=i " u 

Afiv(t fc _i) / , L 



/ \ 2 
/ 7TZ 



< 



q.t{Xi,N(tk-l),z) dtj dz 
^iMt k -i)-^) n\^ Xi>N (t k ^),z) dt\ dz 
^ e^^^-Oj y e^sin^J (L - z) dz 



i=i 



e ^ sin I — I ( ' 



C , . 



< ^ • iV(t fe -i) • -£2- < CiV 2 d(iV). 



(52) 



We next consider the case t k —\ + (logiV) 2 <t<t k — (log A r ) 2 , and from Lemma[5]we get that 
on Gjv,fc-i, 

Afjv(tfe-l) />t fc -(log7V) 2 /■£ / /•£ \2 

yZ / %-t fe _i(^,iv(ijfc-i),2)( / qt k -t{z.,y) dy ) dzdt 



i=i 



/tfc.x+aogw) 2 Jo 



e r sin 



L J 



-e? z sin ( — ) e-™ sin ( -f- ) ) dt 



< 



Cfa — tk-l)ZN{tk-l, 

L 3 







i: 






L 






sin ( 









7TZ 



L J 

Try 



e -w S in ( -j- ) dy 



< ^a»g^) 3 ^fa-i) ^ i cee _ l/2N2 _ 

L 6 L 6 L z 



(53) 



Consider now the case t k - (log iV) 2 < t < t k - (log iV) 7/4 . Note that if t < C(logiV) 2 , then 
e (i-/^ 2 /2)t < SQ by (TJ5J) and Lemma HI 

qt(x,y) < jeH*mn (f) e ^ sin (t) fX^' 7 * 
Breaking up the sum into blocks of size M = \L/yt\ gives 

2> 2 e-W 2 < ^ M (M(£ + l)) 2 e- 7r2 ( A/ ") 2 '/ i2 < M^{t + ife^ < C L ' 



n=l 



£=0 



£=0 



t 3/2 
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Therefore, 



Miv(tfe-i) „ tfe _(l g7V) 7 / 4 
^ it fc -(logAf) 2 Jo 



qt-t k ^ 1 ( X hN(tk-i),z)( / q tk -t(z,y) dy) dz dt 



^ E 

i=l 



(log TV) 7 / 4 JO 



L 



\L ) 



L L 2 



TTZ 



/ /-(log AT) 2 i \ / /• 

<CL 3 ^(t fc _!) / -j* 

Vi(log7V) 7 /4 t J / V JO 



^rrre^ sin — e" ra sin dy dz dt 



e M2 sin ( — | dz 



Try 



< CL 3 ZN(tk-i) ■ 



L J 

1 e^ L 1 Ce'^iV 2 



L 



e ^ y sin ( — ) dy 



■ — < 



(log N) 7 1 2 L 3 L 2 ~ (logN) 1 / 2 ' 



(54) 



Next, consider the case t k - (log iV) 7 / 4 < t < t k - 1. Using PSJ, and the obvious fact that the 
density ^(x, y) of Brownian motion killed at and L is dominated by the transition probabilities 
of standard Brownian motion, for t < (logN) 2 we have 



< 



tV2 



(55) 



We split the integral over z into two pieces and obtain 

r2L/3 
lt h -(logN)V 4 JO 



Mjv(t fe _i) t i 

E 

i=l 



L \ 2 

gt-t^^jv^jfe-i)^)! / q tk -t{z,y) dy) dz dt 



o 



< 



< 



< 



L 

Z 

CZ N (t k _i) 
L 



(log TV) 774 /-2L/3 
JO 

(log7V) 7/4 /-2L/3 

Vo 

(log AO 7 / 4 j 



q t {z,y) dy dz dt 



ttz\ ( f L e^e'M \ 2 , , 
dy dz dt 



L 7 



o 



tV2 



t 



dt 



2L/3 M2 - (™ , • 
e M sin — 



o 



e-^ dy 



< CZjV ^- l} • log log iV • e 2 ^ 3 = o(7V 2 ) 



(56) 
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and 

Mjv(tfc-i) tk -l ,-L / ,-L \ 2 

^2 / Qt-t k _ 1 ( X i,N(tk-i),z)[ / q tk -t(z,y) dy) dz dt 

trt Jt k -Qo S N)i/*J2L/3 \Jo J 



< cz «fa-'» /• (I ° 8A,) " 4 t L e ,j t Li \-„ e - { -,- yfniiyJr t L e -„ dy \ 2 dzdt 

L J l J2L/3 \Jo JL/3 J 

< CZN ^ k - 1 \ \ og N) 7 '^ L ( e -^ e -a°g^) 1/4 /36 dy + e -^/a^ 2 

< C7V(IogiV) 1B / 4 Z JV (t fc _ 1 )(e-( to s JV ) 1/4 / 36 + e-^ 3 ) 2 = o(N 2 ). (57) 



Finally, if < t < 1, then f Q q t (z, y) dy < e* < C, so 

z2 Qt-t k ^( x i,N{tk-i),z)i / q tk - t (z,y) dyj dz dt 

(¥)(/•*•>*)'** 

< CZjV ^- l} £ e"^ sin < = o( iV 2 ). (58) 

The result now follows from (|52 |> . (|55 |) . <|53|> . (|5B |> . I|57 |> . and <|55 |l . □ 
3.2 The number of particles that hit La 

For k G N, let denote the number of times t between tk~\ and that a particle reaches 
La at time t and, for all it £ i), the ancestor of this particle at time u was in (0, La)- 

Equivalently, Rk is the number of particles that are killed by hitting La between times t^-i 
and tk- Note that particles can reach La before time tk-i and still contribute to Rk- Below we 
calculate the conditional mean and second moment of Rk given J : t k _ 1 - 

Lemma 15. Suppose there is a single particle at x at time zero, where x G (0,La)- Suppose 
particles undergo branching Brownian motion with drift —fj, and are killed when they reach or 
La- Let R be the number of particles that hit La between times t and t + k, where < k < 1. 
Then 

E[R] = 2ne A K e^ 2 /^ 2 /^ ■ e^ sin (^) (1 + + ^ , (59) 

where \D\ is bounded by the right-hand side of M6\) with La in place of K , and o{k) is a term 
whose absolute value is bounded by g(n) for some bounded function g : (0, 1) — > (0, oo) with 
lim^oS^) = 0. 

Proof. Let (Bt,t > 0) be standard Brownian motion started at the origin. Suppose that (for the 
branching Brownian motion), there is a particle at y at time t, and let < k < 1. The expected 
number of descendants of the particle at time t + k is e K , and the drift of —\x can only reduce the 
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probability that a Brownian particle reaches La- Therefore, an upper bound for the expected 
number of descendants that reach La at by time t + k is 

J 9 r°° 

e K P( max B t > L A - y) = 2e K P(B K >L A -y) = e K \ — e^ 2 * dz, (60) 

0<t<K V KIT J L A —y 

where the first equality follows from the reflection principle. To get a lower bound, we may 
ignore the branching, and bound the probability that Brownian motion with drift — /i reaches La 
by time k without hitting the origin by the probability that ordinary Brownian motion reaches 
La + [J>k by time k without hitting [in. For y > fj,K, this leads to a lower bound of 

P( max Bt > La — y + /ti/«) — P( min £>£ < — y + /ik) 




2 

K7T 



OO /-00 

2/0,, . / _,2, 



e z / 2 *cte- / e-^dz). (61) 



From Lemma[5j the expected number of particles in the set B at time t is (f B pt(x, y) dy){l + 
D' t (x,B)). Now integrating over y and applying (|60|) . we get 



E[R]<(1 + D)J q A p t (x,y)-e K ^m™ e^^dz^dy 




'La-V 

2 2 .(l-^-^/^)*^,.!-. Z' 7 "^ 



(1 + J D)e K W 1^* I^APef* sm — 

K7T La \LaJ 



l A r e" w sin (^)e- z2 / 2K dzdy, (62) 
•/ JL A -y \LaJ 



where \D\ is bounded by the right-hand side of (|16p with in place of if. Interchanging the 
roles of y and La — y, then using Fubini's Theorem followed by the bound siny < y for y > 
gives 

cL^ roo / mi\ r, , r^A foo 



[ A I" e~ w sm ( ?-) e~ z2/2K dz dy = e~^ A f A [°° sin ( e^' 2 * dz dy 
Jo JL A ~ y \ l aJ Jo J y \ l aJ 

roo rmin{z,L A } / \ 

J J e^sinl^je^^dydz 
< e-^ J°° (f Q Y- A dy) e^e- z2 ' 2K dz 



■Kq-V-La poo 



2L A 



2L A 



z 2 e» z e- z /2k dz 



(63) 

The substitution y = zj^fk gives 

roo 

z 2 (e^ z - l)e- z2 / 2K dz = k 3 / 2 / (e™^ - l)y 2 e -^/ 2 dy, (64) 
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and the last integral goes to zero as n — > by the Dominated Convergence Theorem. Therefore, 
combining ([62]) . ([63j) . and we get 

m S (1 + - • '^'ef sin (-) ■ 23/2La (1 + <*.)). 

Since L A = (2- 1 / 2 log iV)(l + o(l)) and e"^ = e A (l + o(l))/(iV(log iV) 3 ), it follows that E[R] 
is bounded above by the right-hand side of (f59|) . 

We next establish the lower bound. Truncating the outer integral at La/2 and using (|6ip . we 
get, for some D whose absolute value is bounded by the right-hand side of (fl~6j) with La in place 
of K, 



E[R] >(1 + D) Pt (x, y)J^-( e~ z2l2K dz - H e^ 2 / 2K dz ) dy 

J L A /2 * KIT \J L A -y+[iK Jy-fiK 

TTX \ 



(i + D) PL . 2L e (i-fn-*y*Li)t e » x sin ( ^ 

V kit L A \La, 
x e -w s i n ^V r e~ z2 ^dz- [°° e~ z2 ^dz\dy 

Jl a /2 \LaJ \JljA-y+fJ.K Jy-fiK J 

(1 +D)J^- A^-^-rV^V* sin f ^ 

V kit L A \ l aJ 

i>La roc 

/ / e'M sin 

'0 JLA—y+HK 

fl-A / \ / /"OO 



[ LA e-M S m(^)( e- z2 / 2K dz)dy 

JLa/2 \La) \Jy-fj,K J 

/•La/2 rco 



-f A P e~^s\n(^-\e~ z2 l 2K dzdy\. (65) 
Jo JLA-y+n*. \^AJ ) 

To bound the second term in (f65l) . note that by substituting w = z/y/n and using the fact that 
Ix° e ~ W ^ d" w — x~ 1 e _x ' / 2 , we get 



e - z2 / 2K dz < 2 V^ c -(L A -2^) 2 /8«^ 



Therefore, 

f e -» si „ (st V f , - /* dz ) dy < / L * e -» sin C r dz ) dy 

JL A /2 \LAj\Jy-fiK J JLa/2 \La)\J±L a -Hk J 
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To bound the third term in ( I65D . note that 
Jo Jl a -% 



e-^siJf^je-" 2 ^ dzdy 



"La r 00 
< ' 1 / / ('" sin I — | 

' L A /2 Jy+fiK \La 
poo rmm{z—iiK,L,A} 



e 



roo rram{z-fiK,L A } / \ 

» La / / sin e" z ' 2k dy dz 

J L A /2 J L A /2 \La) 



< —r / z 2 e^ z e~ z ' 2K dz = — o(l). (67) 

L A Jl a /2 V L A J 

For the first term, we argue as in the proof of the upper bound, and then use that e^ y > 1 and 
siny > y — y 3 /6 for all y > to get 

La r°° / 7T7/ \ 9 , 

*y \ _- z *, 2k dz dy 



JLa—V+Lik 



La 



e 



roo fvam{z-HK,,L A } / \ 

^ y y e^ S m(^-)e- z2 / 2K dydz 



%e -»L A roo rmm{z-^,L A } ^-f^A f°° ^ , 

> — t / / ye 1 dy dz ^ — / y e z dy dz 

La Jo Jo 6Zft J J 



2L A 



■ / min{z - M k, L A } V* /2k - 6 / zV z / 2k 
Jo 24L A J Q 



The second integral is a constant times k 5//2 . The first integral would be k 3 ^ 2 ^/tt/2 if we had z 2 
in the integrand in place of min{z — fin, La} 2 - Also, 

roo roo 

/ (z 2 - mm{z - {j,k, L A } 2 )e~ z2/2,t dz < / max{2/iKz, z 2 l{ z > LA y}e~ z2 /2k dz. 
Jo Jo 

If we use 2[ikz in the integrand, the integral is bounded by Cn 2 . If we use z 2 l{ z >^ A }, the integral 
divided by k 3//2 tends to zero uniformly over k G (0,1) as N — > oo, so the integral is k 3//2 o(1). 
These observations, combined with (j65[) and the bounds in (166p and (|67[) . imply that is 
bounded below by the right-hand side of (foTij) . □ 

Proposition 16. We /wroe 



A ^.lfa-l)* 8 ,., , , , Ce A Y N {t k _ x ){l + o{l)) 



EiRklTt^] = 27re A • "'V ' (1 + O(|A|0) + o(l)) + 



iV(logA^) 2 v iV(logiV) 3 

S^I^J = 2^ • §7^^ (1 + O(W«0) + o(l). 

Proof. We first consider the particles that reach L A between times i^-i and + (logiV) 2 . 
Define Rk{t) in the same manner as R^, but counting only particles that reach La between times 
ijfc-i and t. Let Rki = Rf.(tk—i + (logiV) 2 ). We now consider the martingale from Lemma [6j 
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Since /U 2 /2 — 1 < 0, this process will still be a supermartingale if particles are stopped, but not 
killed, when reaching L A . More precisely, for tk-i < t < tk, let Xffj(t) = X^n^) if, for all 
u € [tk-\,t), the ancestor at time u of the individual Xi^(t) is in (0,La), and let X- £(t) = 
otherwise, and then for t k -\ < t < tk, define 



M N (t) 

^L A + ( M 2 /2-l)(t-t fe -i) , T L ^^V iX l iW + ^ 2 /2-l)(t-t fe -i) 



V A (t) = R k (t)L A e» L ^ /2-i)(*-* fc -i) + Y, X-Mty 



i=l 



Then (V A (t),tk-i < t < tk) is a supermartingale with respect to {J-t,tk-\ <t< t k )- Therefore, 



V A (t k -i) > E[V A {t*-i + (log AO z )|J- tfc _J 

> i?[ J R fc , 1 L A e^ + ^/ 2 - 1 )( lo ^)V tfc _ 1 ] 
= LAe ^+(^/2-i)(iogiV)^ [jRfcii |^_ i] . 

Note that since X^(t) < L A , we have V A (tk-i) < L A Yjsi(tk^\), which means 

Since (1 - fi 2 /2) < C/(logN) 2 and > JV(log A^) 3 e" A (l + o(l)), we have 

^M^-J < • (68) 

We next consider the particles that reach L A between times t^-i + (logiV) 2 and t k . The 
strategy will be to choose a small number 5, break the time interval [tk-i + (log N) 2 ,tk] into time 
intervals of length 5, and then use Lemma [15] to estimate the number of particles that reach L A 
in each of these intervals. We first make three remarks concerning the application of Lemma [151 
First, note that if the interval starts at time t, then t — tk—\ plays the role of t in Lemma [T5l 
Since t — t^-i < (log N) 3 9s, equation (|39j) implies that 

| e (i-^/2- 7 r 2 /2ii)(*-* fc - 1 ) _ !| < C \ A \ 6 + (69) 

Second, we need to consider all particles at time t rather than just a single particle at x, so in 
place of e^ x sm(7rx/L A ), we have the expression Z' N1 (tk-i) from (|37|) . Third, note that by (fT6j) 
with K = L A , the error term \D\ is bounded by C(l + o(l)) for £ > tk-i + (logiV) 2 , and |.D| is 
o(l) fort^t^x + OogiV) 5 / 2 . 

Let i?fc t 2 be defined in the same way as R, but counting only particles that reach L A between 
times t fc _i + (log iV) 2 and t k -i + (log TV) 5 / 2 . We can divide this time interval into at most 
5 _1 (log TV) 5 / 2 time intervals of length 6, so by Lemma PT5l 

S Wl^l ^ Ce ^ • AT(logiV)5 • 5— " (1 + " NQogN)*/* ' ( 7 °) 

Let Rk,3 be defined in the same way as R, but counting only particles that reach L A between 
times tk-i + (log iV) 5 / 2 and tk- This interval can be divided into <5 _1 (log N) 3 8s(l + o(l)) intervals 
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of length 5, so by Lemma [T5l and ([69]) . 



£[i4, 3 |^_J = 2vre A 5(l + 0{\A\9)) 



Z^ 1 (t fc _ 1 )(l + o(l))(l + (5)) (\ og N)Hs 



N (log Nf 



A Z' Nl (t k -i)0s 



= ^ iv'(logjV)^ (1 + OWN 1 + + °^ + °^ ^ 

The first statement of the proposition follows from (|68p . (|7U|) . and (|71|) by choosing 5 as a function 
of iV so that 5 — > as N — > oo. On Gjvfc-i, the second statement follows from Lemma [TOl □ 

The corollary below follows immediately from the above proof, because the number of particles 
that hit La between tk — (log iV) 5 / 2 and tk can be bounded in the same manner as E[Rk t 3\J r t k _ 1 ], 
and the number of intervals of length 5 is only (log N) 5 / 2 /5. 

Corollary 17. Define Rk the same way as Rk, except only counting particles that reach La 
between tk — (log iV) 5//2 and tk- Then E[Rk\J r t lc _ 1 ] is o(l) on Gjv,fe-i- 

Proposition 18. Assume A > 0. On Gj\rfc-ij we /iaue 

^|JVi]< j\r(log A^) 2 

Proof. For the purposes of this proof, we may assume that particles are killed upon reaching La- 
Note that R\ = Rk + 2Y, where Y is the number of distinct pairs of particles that get killed 
upon reaching La- We may further write Y = Y\ + Y2, where Y\ denotes the number of pairs 
of particles that get killed upon reaching La whose most recent common ancestor is before time 
tk-i and Y2 counts the other pairs of particles. Proposition [TBI and ([33]) give that on GN,k-i, 

*«.-^W 1+ ' ,1) ' (72) 

If there is a particle at x at time tk-i and a descendant of this particle reaches La by time tk, 
then the number of pairs in Y\ involving this descendant will be precisely the number of particles 
descended from particles other than the particle at x at time tk-i that reach La by time tk, which 
is bounded by Rk- Because descendants of different particles evolve independently, it follows that 

Efrfr^] < (ElRkl^f < c( ^ g ( ^ 2 l) ) 2 + o(l). (73) 

It remains to consider Y%. Because pairs of particles contributing to Y2 have the same ancestor 
at time tk-i, we may consider separately the contributions of the particles at time tfc-i. Assume 
for now that there is a single particle at x at time tk-\, and we denote the number of associated 
pairs of particles contributing to Y% by Yf. Let h(t,y) be the expected number of offspring of a 
single particle that is at y at time tj--i + t that will hit La before time tk- A branching event 
at (tfc_i + t,y) produces, on average, h(t,y) 2 pairs of particles that hit La and have their most 
recent common ancestor at time t. Since each particle branches at rate 1, 

/>(log N)^6s rL A 

E[Y 2 X }= / / q t {x,y)h(t,yf dy dt. (74) 

Jo Jo 
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Since h(t,y) < h(0,y), it follows from Proposition 1161 that 

, Ce A 9e^ (iry\, , * . Ce A e^(l + o(l)) 

^ y) * FooiivF sm U) (1 + 0(1)) + mkw 

Ce A (l + o(l)) / m (ny\ 

where the o(l) term tends to zero uniformly in y as N — > oo. 

We first evaluate the portion of the integral in (|74|) when t < (logiV) 2 . Recall from (|28p that 
when t < (log N) 2 , 

q t (x,y)<Ce^-^v t (x,y), 

where vt(x, y) is the density of Brownian motion in the strip (0, La), defined as in (|14p with La 
in place of K. Therefore, changing the order of integration, 

-.(logTV) 2 r L A 

Qt(x,y)h(t,y) 2 dy dt 



Ce 2A e^(l + o(l)) f A ( av . iry\ e™ \ 2 _ uv [ {logN) , , A 
< ATO/1 v -\t\ a / \Qz m sin -r - + ; 77 e w / v t (x,y) dt) dy. 

iv2(io g ^)4 y v fogivy V7o y 

By ([51]), J °° «t(a;,y) dt < 2(L A - y). Using also that (a + b) 2 < C(a 2 + b 2 ), that sin(iry/L A ) = 
sin(7r(L A - y)/L A ) < n(L A - y)/L A < C(L A - y)/Qog N), that = JV(log iV) 3 e - A (l + o(l)), 
and that (|32|) holds, we get 

(log AO 2 , 

q t (x,y)h(t,y) 2 dy dt 
„ Ce 2A e^(l + o(l)) y LA (6 2 eW{L A - yf e m {L A - y)\ , 

- A^iogAO 4 y v o^v? + oogiv) 2 J y 

Ce 2A e/-(l + (l)) ^ < Ce^(l + o(l)) 

" iV2( logA r)6 { } ~ N{\ogNf ' 1 ; 

When t > (logiV) 2 , Lemma [5] implies that 

q t (x,y) < Cp t (x,y)(l + o(l)) < C( ^ (1)) e^sin fe) e -^sin 
because A > 0. Therefore, 

"(logAf) 3 6»s f-L A 
'(logJV) 2 JO 



q t (x,y)h(t,y) 2 dy dt 



7TX 



Ce 2A (l + o(l)) 
iV 2 (logiV) 5 \L A 



(logTv) 2 Jo \LaJ V v^a/ fogivy 



iV 2 (logiV) 2 1 \LaJ Jo \ (logiV) 3 (logiV) 3 

0e A (l + o(l)) , x (*x\ 
N(logN) 2 Sm {L A )' 
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using that the last integral can be bounded by Ce^ LA /(\og N) 3 < Ce~ A N. By combining (JTSJ) 
and ([77|) and summing over the contributions from different particles, we get on Gn^-i, 

^PVxl ^ { N(logN) 3 + Ar(i og Ar)2 J C 1 + o C0) < Ar(i ogiV ) 2 + ( 78 ) 

where the last inequality uses Lemma [TUl The result now follows from (|72|) . (|73p . and (|78p . using 
the assumption from (|35|) that ^e^e -1 / 2 < 1. □ 

Remark 19. By Proposition \16\ and Markov's inequality, we see that when 8 is small, during 
most intervals [t k -i,t k ], no particles reach La- Using Propositions [77JI and [TS\ and the second 
moment method, we get that on Gjv.fe-l; 

{E[R k \F tl A) 2 
P{R k >Q\F tk _ 1 )> y ! ' ' 1 



^[^fcl^fc-i] 

A> 



iV(logiV) 2 ) U A ^-i J( ' 



Cee A Z N (t k _ 1 )(l + o(l)) 
N(logN)< 



Thus, it follows that 



E[R k \R k > CJVJ = Bzf^W^ ^ ^ ^ + 

P\Rk > U|j%_J 

TTiat is, conditional on the event that at least one particle reaches La, the expected number of 
particles that reach La is bounded by a constant that does not depend on 9 or A. 



3.3 The probability of G N (e) 

We have now acquired enough tools to prove that the probability of Gn (s) is close to 1 when e is 
small and N is large. It is this result that allows us to work on the event Gn throughout much 
of the paper. 

Throughout this section, we will assume that particles are killed upon reaching La- Define 
Zjs;{€) and Ypfit) in the same way as Z^{t) and Y/v(i) in @ and (fTUj) . but for this modified 
process in which particles are killed upon reaching La- Also, we use La rather than L in the 
definition of Zjv(i). 

Lemma 20. For any fixed A G R and any fixed t >0, under the hypotheses of Proposition^ we 
have 

lim P(Y N (t(logN) 3 ) > N{logN) 3 h{N)) = 0. 

W->oo 

Proof. Let be the event that all particles at time zero are in (0, La)- We have P{Bn) —> 1 
as N — y oo because on the event that there is a particle to the right of La at time zero, we have 
Fjv(0) > N (log N) 3 h(N) by ((3TJ), and Y N (0) /(N (log N) 3 h(N)) converges in probability to zero 
by the definition of h(N). On Bn, we have 1jv(0) = Yjsr(0), so the result holds when t = 0. 
Suppose instead t > 0. By (fT8j) . on B^ we have 

E[Y N (t(logN) 3 )\F Q ] = -e^/ 2 -^/ 2L An^ N ) 3 (l + D)Z N (0), 
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where \D\ is bounded by the right-hand side of (|16p with La in place of K. Therefore, using 
(1391) . on Bn we have 



E[Y N (t(logN) 3 )\F ] < Ce- 27r2 ^( 1+ °W)^(0)(l + O (l)). 
Therefore, by Markov's Inequality, on Bn we have 

E[ifr(*(io g #) 3 )|.Fo] 



P{Y N {t{\ogNY) > iV(log N) h(N)\J r o) < 



< 



iV(log N) 3 h(N) 
C e -2T 2 ^(i+°(i))^ JV (0)(l + o (l)) 



N(logN) 3 h(N) 

The right-hand side converges in probability to zero because Z7v(0)/iV(log N) 2 converges in dis- 
tribution to v and (log N)h(N) ->ooasiV-> oo. The result follows. □ 

For the rest of subsection 13.31 we will assume that A < 0, so that La > L. 

Lemma 21. Fix A < and t > 0. For all k > 0, there exists a positive constant C\, depending 
on k but not on A or t, such that under the hypotheses of Proposition^ 

p( max Z N {r)>-e- l l 2 N{\ogN) 2 \<K + C l e 1 / 2 e- 2 ^ At{1+0 ^. 

Proof. By if U(r) = e~^~^ '' /2 ~^ ''/ 2L a> Z N (r), then (U{r),r > 0) is a martingale. Since 

A < 0, we have 1 — fJ> 2 /2 — ir 2 /2L 2 A > 0. Therefore, by Doob's Maximal Inequality (see, for 
example, the p = 1 case of Theorem 1.4 in |24j ) and (j39|) . 

P( max Z N (r) > -e^A^log N) 2 Fq] 

V0<r<t(logAf) 3 2 / 

•f max C/(r) > i e - 1 /2 A r(iogAr)2 e -(i-^/2-^/2Li)t(iog7V)3 

2Z JV (0) g 1 / 2 e( 1 -^/ 2 "" 2 / 2L A)*a°g^) 3 2Z jV (0) £ 1 /2 e -2^^(i+o(i)) 
- N(logN) 2 - iV(log iV) 2 " 

Because the distribution of ZAr(0)/A^(log A^) 2 , and therefore that of Zjv(0)/iV(log iV) 2 , converges 
to v, there exists a constant Ci such that P(Zjy(0)/N(log N) 2 > C±) < k. The result follows. □ 

Lemma 22. Let k > and t > 0. Under the hypotheses of Proposition [ZJ t/iere exist positive 
constants C2 and 7, depending on k and t, such that for all A < 0, t/ie probability that some 
particle reaches La before time t(log AQ 3 zs at most C2e 7j4 + k /or sufficiently large N . 

Proof. Let J = [47r 2 t]. For 1 < j < J, let A,- = 2^- J ,4 and Sj = (j/47r 2 )(log AQ 3 . Let s = 0. 
Consider a modified branching Brownian motion, defined up to time t(log A^) 3 , in which particles 
that reach La } between times Sj—\ and Sj are killed. Because L^. < La for all j, it suffices 
to bound the probability that at least one particle gets killed in this new modified branching 
Brownian motion. 

Let Mjsr(r) denote the number of particles alive at time r, and denote the positions of these 
particles by X\^{r) >■■■> X MN ( r ^ N (r). For r G [sj-i,Sj], define 

Z Nd (r)= £ e^«sin(^M 
i=l ^ A i 
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and 

M N (r) 

Y N (r) = e ^'" (r) ' 
i=i 

For all r G [sj-i,Sj], we have, using (fT9l) and (f39l) . 

irr* / m tt i M/W/2iJ )(My.i)= , , 
< e -2^^ (Sj _ Sj _ l)(1+o(1) ) /(log jv)3 ^ 

= e -A,(l+o(l))/2^. (Sj _ i) _ (8Q) 

This bound allows us to bound the probability that a particle reaches Lj^. between times Sj-i 
and Sj using Proposition [TUJ We divide the interval from Sj-\ to Sj into smaller subintervals of 
length approximately #(logiV) 3 . More precisely, define times Sj-\ = uq < u\ < • • • < ud = sj 
such that #(log N) s < Uk — Uk-i < 2#(log AQ 3 for all k, which is possible as long as we choose 
9 < 1/47T 2 . Letting Rj^ denote the number of particles that reach La. between times Uk-i and 
Uk , we get from Proposition [IBJ 



M p i-r ^ Ce A ^K-i)fl(l + o(l)) Ce A >Y N (u k .{)(l + (1)) 
[ J ' fc| Ufc - lJ " N{\ogN) 2 + iV(logAf) 3 

By Markov's Inequality, 

F(i? ijfe > ) < N{logN) 2 + min ( iV(logiV)3 > 1 

By Lemma [20l applied at the times uq, . . . ,ud, the second term is o p (l), which means it tends to 
zero in probability as N — > oo for any fixed values of the parameters A and 6. Therefore, 



P{R hk > 0|.F s ._i) = E i p ( R i,k > 0|.F„ fc _i)l-F«j-i] 

Ce A iE[Z NJ (uk z ^\Ts l _ 1 }e(l + o(l)) 
N(logN) 



^ A7"\2 + OpW- 



Let = Y£=i Rj,k- By <JBDD and the fact that D < C/6, 

P(i?, > 0|^_J < /V(logiV)2 + ' ' 



Let Gj be the event that YAr(sfc) < N(logN) 3 h(N) for fc = 0, . . . , j, and let G = Gj-i. We 
have -P(G) = 1 — o(l) by Lemma [201 We now show by induction that for j = 1, . . . , J, on Gj_i 
we have 

E[Z^( S ,)l^o] < Ge-^ 1+ -+^)( 1+ °W)/ 2 ^(0) + o(iV(log AO 2 ). (82) 

The j = 1 case follows from JSU and the fact that \Z NA (0) - Z N (0)\ < C\A 1 \Y t f(0)/QogN) 
by (f38l) . the difference between the expressions coming from the fact that La 1 is used in the 
definition of Zjvi(O) and L is used in the definition of Zjv(0). Suppose the result is true for j — 1. 
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On Gj-i, we have, using ([80]) and the argument leading to (f38|) . 

< e-^( 1 +°( 1 ))/ 2 Z A r ) ,_ 1 ( Si _ 1 ) + e-^+^lZxjis^) - Z^-ifo-i)! 

< e-^^ 1 ))^--!^--!) + Ce-^( 1+ °W)/ 2 |^ - ^_ 1 |^ v ( s ,_ 1 )/(logiV) 

< e-W-KtWZNj-xisj-i) + o(N(logN) 2 ). 

Taking conditional expectations with respect to J-" so and applying the induction hypothesis gives 
The result (|82p for all j = 1, . . . , J on Gj-i follows by induction. 
We now take conditional expectations with respect to T so on both sides of (|8"Tj) . Using that 
\Z]yj(sj-i) — ZN,j-i{sj-\)\ = o(N(log N) 2 ) on Gj-\ as shown above and that 



Aj - {Ax + • • • + Aj-_i) = 2i- J A - (2 J - 2)2~ J A = A x = 2 X ~ J A = A 



we get 

P(P, > 0|J- S0 ) < A^bgJVT p( } 

C e (^-(^+-+^-i))(i+°W)/2^(0)(i + o (i)) 

^foiivp + 0p(1) 

ge Ai(l+o(l))/2 Zjv(0)(1+o(1)) 

S iV(logiV) 2 + ° M j - 

Therefore, the probability, conditional on .F so , that some particle reaches A = Aj by time 
i(logiV) 3 is at most 

CJe A 2 -^(l + o(l)) Zjv(0)(1+o(1)) 

iV(logiV) 2 + ° pl j - 

Since Z/v(0)/iV(log iV) 2 converges in distribution to ^ as iV — > oo, there is a constant c such that 
P(Z7v(0)/iV"(log iV) 2 > c) < re/2 for sufficiently large N. The result follows. □ 

Proposition 23. Under the hypotheses of Proposition^ we have 



lim sup ( limsup(l — P{Gn{£))) ) = 0, 



where the supremum is taken over all values of 6 > such that 9 G N. 

Proof. Let < e < 1, and let k > 0. Choose Ci as in Lemma [2"T1 and choose 7 and C2 as in 
Lemma \22\ with u + s in place of t. Choose A = (log e)/(8-7r 2 [u + s)) < 0. We now assume that 
9 is small enough that assumptions ()32j) - f)35j) hold for these choices of e and A, so that previous 
results in this section may be applied. This assumption is permissible because dividing 9 by a 
positive integer to make it small enough to satisfy these conditions can only reduce the value of 
P(Gjv(e)) by adding additional times at which conditions on Y/v and Zn must hold. 

By Lemma [22l the probability that some particle reaches La by time (u + s)(logA r ) 3 is at 
most 

K + C 2 e 7(loge)/(8 ^ {u+s)) (83) 
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for sufficiently large N. By Lemma [20l 

lim P(Y N (tj) > N(logN) 3 h(N) for some j < 9' 1 ) = 0. (84) 

N— >oo 

By Lemma I2T1 

limsnp p(z N (tj) > V 1/2 iV(logA0 2 for some j < e~A < K + C 1 e 1 / 2 e-0°* e X 1+o Q)V A 

< K + Cl£ (l+ (l))/4_ 

Using (|38p . on the event that Y^(tj) < N(logN) s h(N) and no particle reaches La by time 
(u + s)(log A^) 3 , we have 

z N{tj) < + ^MWW < 2w(tj) + i E -v 2w(logAr)2 

for sufficiently large N . Thus, 

lim sup P(Z N (tj) > e -1 / 2 AT (log AT) 2 for some j < O' 1 ) 

<2K + de 1/4 + (- e 7(log £ )/(8^(u+s))_ (g5) 

As Y^j{tj) = Y]y(tj) and Z^{tj) = Z^{tj) when no particles reach La by time (u + s) 3 (log N) 3 , 
we see that 1 — P(Gn(z)) is bounded by the sum of the probabilities in (i83|) . ([Ml) , and (|85|) . 
Since none of the bounds depends on 0, it follows that 

lim sup sup I lim sup (1 — P(Gjv( £ ))) ) < 3k, 
and the result follows by letting k — > 0. □ 



4 Critical branching Brownian motion with killing at —y 

Consider a branching Brownian motion with drift — v2 started with a single particle at 0. From 
this process, a modified process can be constructed in which particles that reach — y are killed. 
Let Z y denote the number of particles that reach — y and are killed. Note that Z y has the same 
distribution as the number of particles that hit zero in branching Brownian motion with drift 
— \/2 and absorption at zero, started with a single particle at y. Because this process almost 
surely goes extinct by Theorem 1.1 of [43] , and it is easy to verify that infinitely many particles 
will not reach the origin within any finite time interval, we see that Z y is almost surely finite for 
every < y < oo. Furthermore, because each particle that reaches — x behaves thereafter like 
another particle started at zero, the number of particles that reach — (x + y) conditional on Z x 
is the same as the distribution of the sum of Z x independent random variables with the same 
distribution as Z y . Consequently, the process (Z y )y>o is a continuous-time branching process, as 
is shown in section 5 of [56] . As noted in [56J and in the more recent work of Maillard [53] , this 
branching process is not in the LlogL class. However, the following proposition appears on p. 
238 of [56]. 
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Proposition 24. There exists a random variable W such that almost surely 

lim ye~^ y Z v = W. 

Furthermore, for all m£|, we have 

E[e-^ 2uw ]=^(u), (86) 
where ip : R — > (0, 1) solves Kolmogorov's equation 

^V" -vV = V(i-^)- 

Corollary 25. Let 77 > 0. There exists y such that 

P(\ye-^yZ y - W\ > 77) < 77. (87) 

Moreover, there exists £ > suc/i i/iai if particles are killed when they reach —y, the probability 
that any particle remains alive after time £ is less than r\. 

Proof. Equation (|87p is immediate from Proposition [25J The second statement follows from the 
fact that Z y is almost surely finite, and therefore so is the time when the last remaining particle 
hits —y. □ 

Our goal in this section is to show that P(W > x) ~ B/x as x — > 00 for some constant B. 
The strategy will be to consider the Laplace transform i?[e~ AW/ ] for small values of A, and then 
apply a Tauberian theorem. From (186 j) . we see that this requires having asymptotic results for 
ip(u) as u — y —00. Equivalently, if we define w(x) = ip(—x), then 

\j" + V2w' + w(w - 1) = 0, (88) 
and we are looking for asymptotic results for w(x) as x — > 00. It is well-known that 

1 - w(x) ~ Cxe~^ x ; (89) 

see, for example, (11) in [47] or (1.13) in [T7]. However, this result turns out to be insufficient 
for our purposes. The asymptotic result that we will need is given in the proposition below. 

Proposition 26. Suppose that w is an increasing function satisfying 188\) with linx^oo w(x) = 1 
and lim^^oo w(x) = 0. For all x, let u{x) = 1 — w(x) and v(x) = u(x)/(xe~^ x ). Then for all 
c > 0, we have 

lim x(v(x + c) — v(x)) = 0. 

Proof. Let x > 0. Let (Rt,t > 0) be a three-dimensional Bessel process with Rq = x. According 
to (2.6) of [38], the process 



X t = v(R t ) exp f - J u(R s ) ds^j 



38 



is a positive local martingale, and therefore a supermartingale. Let T = inf{t : Rt = x + c}. By 
the Optional Sampling Theorem, 



v(x) = E[X ] > E[X T ] = v(x + c)E 



exp 



T 



which means 



v(x + c) — v(x) < v(x + c) I 1 — E 



exp 



u{R s ) ds 



u{R s ) ds 



Let < 7 < 1, and let A be the event that Rt < jx for some t < T. That is, A is the event 
that the Bessel process reaches jx before reaching x + c. By Corollary 3.4 on p. 253 of [61], we 
have 



P(A) 



(x + c) 1 — x 1 C7 



(a; + c) -1 — (7a;) -1 c + (1 — 7)x 

In view of ([89]) . there are constants Ci and C2 such that for sufficiently large x, we have v(x + c) < 
C\ and 

max u(y) < C 2 xe~^ x . 

-yx<y<x+c 

It follows that for sufficiently large x, 

v(x + c) — v(x) < C\E 



1 — exp 



u(R s ) ds 



< C X E 



u + 



J u(R s ) ds^j 1a< 



< dP(A) + C x C 2 xe-^ x E[T}. 

To bound E[T], note that using ^ to denote expectation for the Bessel process started at x, 
and t z to be the first time that the Bessel process hits z, we have -Eol^+c] = -Eb[ T x] + ^asbaH-c] 
by the Strong Markov Property. Therefore, E[T] = E x [t x+c ] < Eo[t x + c ]. Furthermore, the 
three-dimensional Bessel process is the Euclidean norm of three-dimensional Brownian motion, 
which is bounded below by the absolute value of the first coordinate, which is a one-dimensional 
Brownian motion. Therefore, E'ofr^+c] is at most the the expected time for a one-dimensional 
Brownian motion to reach — (x + c) or x + c, which for sufficiently large x is at most C3X 2 for 
some constant C3. It follows that 

limsupx(u(x + c) - v(x)) < limsup ( x ■ C x — — + C^^e'^A = 

x^oo V C + (1 - 7JX J 1 - 7 

Because this holds for any 7 > and C\ does not depend on 7, the result follows. □ 

Proposition 27. Let W be the limiting random variable in Proposition \24\ Then, there exists a 
constant B > such that as x — >■ 00 , 



P(W > x) 



B 



x 
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Proof. Let 4>(\) = E[e~ xw ]. According to the discussion on p. 335 of [12], the condition that 
P(W > x) ~ B/x as x — > oo is equivalent to the condition that the function f(z) = z(l — cj)(l/z)) 
has S-index 1, meaning (see p. 128 of [12]) that for all r > 1, we have 

lim (f(rz) — f(z)) = Blogr. 

That is, P(W > x) ~ B/x is equivalent to the condition that for all r > 1, we have 

lim rz(l — (p(l/rz)) — z(l — <p(l/z)) = Blogr, 

z—too 

or equivalently, letting A = l/z, 

hf (l- W r))-(l-^)) =Bloer 

A— >0 A 



Consequently we need to show that (|90p holds for all r > 1. 

By ([86]) . we have 0(A) = ^((log A)/\/2)- Let w(x) = i/j(—x), so w; satisfies ([88]) . For all x, let 
u(x) = 1 — w(x) and v(x) = u(x)/(xe~^ 2 ~ x ), as in Proposition [26l Then 



log A A /-log A 
Likewise, 

4o g(r/A) A Aog(r/A) A A 
v 7 ^ J\ V2 Jr 
Letting x = (log(l/A))/\/2 and c = (logr)/\/2, it follows that 



1 — 4>(X/r) = v 



r(l-0(A/r))-(l-0(A)) _ / log(r/A) A Aog(r/A) A / log(l/A) A / log(l/A) ' 
A V \ V2 J\ V2 J \ V2 J\ V2 

= v(x + c)(x + c) — v{x)x = x{v{x + c) — v (x)) + cv(x + c). 

As x — )• oo, we have v(x + c) — )• C, where C is the constant from ([59"]) . and x(v (x + c) — 1> (x)) — > 
by Proposition [26j Therefore, 

,. r(l - 0(A/r)) - (1 - 0(A)) . , , . ... . Clogr 

lim — y ' '\ — -^—^ = lim x(t>(x + c) - v(x)) + cv(x + c) = 

A^O A x-*» l y/2 

so ([90]) holds with B = C/^2. □ 

We will see later in the proof of Proposition 1411 that B = 1/V2. 

Corollary 28. There is a constant C such that P(W > x) < C/x for all x, and E\W1jw< x \] < 
Clogx and E[W 2 l{w<x}] < Cx for all x > 2. 

Proof. The first statement is immediate from Proposition [27] Since 

E[Wl {w < x} ]< P{W>y)dy<l+ -dy = l + C\ogx 

Jo Ji y 



and 



E[W 2 1 {W<X} ] < [ 2yP(W > y) dy < 1 + 2 [ y • - dy < 1 + 2Cx, 

Jo Ji y 



the other two statements follow. □ 
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5 The particles after hitting La 



Recall that in subsection 13.14 we obtained estimates on the number of particles in branching 
Brownian motion that never reach the level La, while in subsection 13 . 21 we estimated the number 
of particles that reach La- In this section, we determine how much the descendants of the 
particles that reach La will contribute to the process at later times. The basic strategy will be 
to argue that if a particle reaches La, then the number of descendants that it will have in the 
population a long time into the future can be approximated by the number of its descendants 
that reach La — y, where y is some large constant. The number of descendants that reach La — y 
can be approximated using the random variable W in Proposition [241 



5.1 Notation and constants 

Recall from subsection 3.2 that R k particles reach La between times t k _\ and t k . By Propositions 
[TBI and [T8| on Gjy.k-i we have 

E[R k \ .7VJ < Cee A e- 1 / 2 + o(l) (91) 

and 

£[i^|JVJ < C9e A e^ 2 + o(l). (92) 

These moment estimates will be used repeatedly in what follows. Denote by u\ < U2 < ■ ■ ■ < ur, 
the times at which these particles reach La- Recalling ([9]) and (I36p . define 



i=i 

Note that 



ZN(tk) = ZN,l(tk) + ZN,2{tk), 



and the particles contributing to Z^^{t k ) are precisely the particles at time t k that are descended 
from the particles that reach La at one of the times u\ , . . . , UR k . 

Our aim in this section will be to estimate, on G]y t k-i, the expectation 

E[(Z N (t k ) - ZAr(t fc _i))l( Zjv ( tfc) _ Zjv ( tft _ l) < eAr ( log7V) 2}|J r tfe _ 1 ], (93) 

as well as probabilities of the form 

P{Z N (t k ) - Z N (t k ^) > rN (log AOVtfc-J (94) 

for r > e. We apply the truncation at eN(log N) 2 to focus separately on particles reaching La 
that make a small addition to the value of the process, whose contributions are counted in ([93]) . 
and particles reaching La that lead to large jumps in the value of the process, an event whose 
probability is estimated by (f94"|) . 

Estimating these quantities precisely will involve manipulating seven constants. Recall that 
we have been already working with the three constants s, A, and 9. Throughout this section, e 
will be a fixed number with < e < 1. We will also introduce a new constant 5 > and in fact 
will fix 

5 < e 7 . (95) 
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By Proposition 1271 one can choose x large enough that if z > x, then 



: i - { ) B < m >, ) <(l±^, (96) 



z z 
where B comes from Proposition 1271 We will then choose A > 1 large enough that 

2\/2vre" A 2; < e, (97) 
4e" A/9 < 5/6. (98) 

Once e, (5, and ^4 are chosen, we will choose 9 > small enough to satisfy the following equations: 

AO < 1, (99) 

4vr 2 J 4^ 1 / 2 < e"^ 4 , (100) 

49 1 / 4 < 5/6, (101) 

fl 1/4 e A < 5, (102) 

M 2 e A e" 1/2 < 1, (104) 
Co^ 1 / 2 < 1, (105) 

where Co is a constant to be defined later in (|112p . Note that (|99p and (llOOh were already 
assumed in (|55|) and ([55]). while (|104|) implies ([55]) because A > 1. In this section, we will also 
work with the additional constants 77, y, and £ from Corollary 1251 We will choose r/ = 9. We will 
then choose y to be large enough to satisfy both (|87|) and the equation 

1 < 9y. (106) 

We finally choose £ to satisfy the conditions of Corollary [25] for these values of 77 and y. 

Consider the particle that reaches La at time Uj G (£fc—i,ifc]. Denote by Vj & the number 
of descendants of this particle that, at some time i > Uj, reach — y + (i — Uj)(v2 — A*) 
and have the property that, for all u S [uj,t), the ancestor of this particle was in the interval 
(La — y + (u — Uj)(\^2 — /x), 00). This is equivalent to the number of descendant particles that 
would reach La — y + (t — Uj)(y2 — y) at time t for some t if particles were killed upon reaching 
this level. Denote the first times at which these Vjk particles reach level La — y + (t — Uj)(y/2 — fi) 
by r i,j,k < r 2,j,k < ■ ■ ■ < ry. k j t k- Note that Vj^ has the same distribution as the random variable 
Z y of Proposition [Ml and the adjustment of (t — Uj)(y/2 — y) is necessary because particles drift 
to the left at rate (A, rather than at rate v2 as in the setting of Proposition [MJ Now let 

W' j>h = ye^V hk . 

By Corollary [25l there exists a random variable Wj j- with the same distribution as the random 
variable W in Corollary [251 such that P(\Wj k — Wj^\ > V) < V- Furthermore, it is clear that 
for fixed k, conditional on J : t k _ 1 and conditional on R k = r, the random variables W[ k , . . . , W' r k 

are independent and have the same distribution as ye Likewise, the random variables 

Wj t k can be chosen such that conditional on J~t k _ 1 and conditional on R k = r, Wi^, • • • , W r & are 
independent and have the same distribution as the random variable W in Corollary [ 
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5.2 The contribution of one particle at La 

In this subsection, we show that the contribution to Z^j^itk) from the jth particle to hit La 
can be approximated by ny/2e~ N (log N) 2 Wj ik - As a result, typically Z^(t k ) — Zjv(ifc-i) > 
eiV(log N) 2 precisely when Wj^ > e/(7r\/2e ) for some j < R k . Establishing the validity of 
this approximation requires bounding the probabilities of several unlikely events. 

Lemma 29. Let B\ be the event that there exist ji,j2 < R k with j\ 7^ j% such that Wj uk > e 2A / 3 
and W htk > e 2A / 3 . Then on G N>k - lt we have P{B 1 \T th _ 1 ) < C6e^ A / 3 e- 1 / 2 + o(l). 

Proof. Conditional on Tt kl and R k , the expected number of pairs (ii, J2) with j\ / j 2 such that 
Wj lt k > e 2A//3 and Wj 2jfc > e 2A / 3 is (^ fe )P(iy > e 2A//3 ) 2 , where W is the random variable defined 
in Corollary [251 By Proposition EH P(W > e 2A / 3 ) < Ce~ 2A ^, so 

PiBi]^) < CE[R 2 \F tk ^]e- iA l 3 < Cee~ A lh~ 1 ' 2 + o(l), 

where the last inequality uses (|92p. □ 

Lemma 30. Fix r > e, and let B2 be the event that 

r - 4e" A / 4 - e" A / 9 - 40 1 /* r + 4e -A/4 + 40 i/4 

< Wjj, < 



ir\/2e~ A " 7r\/2e~ A 

/or some j < On Gjvfe-i; Ziawe P(i?2|^ r t fc _ 1 ) < CQbe^l 2 + o(l) ; where the constant C 
does not depend on r. 

Proof. Let 7 = 4e" A / 4 + e" A / 9 + 46* 1 / 4 . Note that 7 < 5/2 < e/2 because 4e" A / 4 < 4e~ A / 9 < 5/6 
and 46* 1 / 4 < 5/6 by ([93]) and (fTUTJ) . Assume 2 is chosen so that ([96]) holds for z > x. By (J97D, we 
have (r - 7)/(7r\/2e~ A ) > (e - 7)/(7rv / 2e~ A ) > x. Therefore, 

< W < T + 1 \ < B ^ + 6 ^^ e ~ A B ^ ~ S )^e~ A 



7rV2e~ A TrV2e~ A J r-7 r + 7 



7 t ~\~ 7 y v r 2 — 7 2 / s 2 

It follows from this bound and Markov's Inequality that P(B2\J r t k _ 1 ) < Ce~ A 6e~ 2 -E[i?fc| J 7 ^ _ ]. 
The result now follows from (I9T1). □ 



Lemma 31. Let -B3 be the event that for some j, the particle that reaches La at time Uj has 
a descendant that at some time t G (uj,tk\ reaches La — y + (t — Uj)(y2 — /j,), and that this 
descendant itself has a descendant that reaches La before time t k . Then on Grjv,fe-i> we have 
P{B z \F tk _ l )<Ce A eV 2 e- l l 2 + o{l). 

Proof. The particle that reaches La at time Uj has Vj ;k descendants that reach La — y + (t — 
Uj){y2 — /x) at some time t > Uj. Let Aj tk be the event that one of these particles reaches 
La — y + (t — Uj)(y/2 — fi) at some time t > Uj + £. By Corollary 1251 and Proposition [27} since 

9 = n < 1, 

P(A j>k U {W' hk > 6- 1 ' 2 } for some j < fl fc |.F tfc _i) < E[R k \F tk _A{2q + P(W > #~ 1/2 - r?)) 

<CP[ J R fc |J- tfc _J(?? + ^). (107) 
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At most y 1 e'^ 2y W'- k descendants of the particle that reaches La at time uj will hit La — y + (t — 
Uj)(y2 — n) at some time t < t^. This is an upper bound rather than an equality because some 
particles may reach this level after time tk . We now consider N large enough that y > C(V2-/u). 
On the event A c - k n {Wj k < 9~ 1 ^ 2 }, the probability that a descendant of one of these particles 

reaches La by time tk can be bounded above by y~~ 1 e^ 2y 9~ 1 ^ 2 times the probability that a single 
particle at La — y + C(V% — m) ^as a descendant that reaches La by time (log N) 3 6s. Using 
Markov's Inequality to bound this latter probability by the expectation of the number of such 
descendants, it follows from Proposition [16] that the probability is bounded above by 

Ce A f oM L A -y + «V-2-»))^J^ L A-y + aV2-n))\ , e ^-s/+C(v^)) 



e ^A-y+CW^)) sin 8 ^' ILL + (1 + G (l)) 



A^(logiV) 2 V V L vi / logiV 

Note that we are applying Proposition [16] in the case when k = 1 and there is just a single 
particle initially at the location L A - y + ((V% - /i). Since e^ hA = N(logNfe' A (l + o(l)), 
sin(7r(L A -y + C(\/2-/i))/L j4 ) < (Cy/logiV)(l + o(l)), and e ^^~^ is l + o(l), this expression 
can be bounded above by 

Ce A 

(Oye-^N (log N) 2 e~ A + e _w JV(log iY)V A ) (1 + o(l)) < Ce~ m (9y + !)(! + o(l)). 



iV(logiV) 2 
Combining these observations gives 

P(S 3 | Ji^) < C£[i? fe | JVJ fa + + y-^d- 1 ' 2 ■ e-W(9y + 1)) (1 + o(l)) 
< CS[J2 fc | ^.J fa + VO + e- l ' 2 y- l ){l + o(l)). 

The result now follows from (|9ip and the assumptions that rj = 9 and 1 < 6y. □ 

Recall that the particles at time tk contributing to Z^^(tk) are precisely the particles at 
time tk that are descended from the particles that reach La at one of the times u± , . . . , UR k . To 
separate the contributions from each of these particles, write i G <Sj if the particle at Xij^itk) at 
time tk is descended from the particle that was at La at time Uj. Then for 1 < j < Rk, define 

ZN,2 d (t k ) = ^-^sin f^jtal^}. (108) 



Note that Zjv,2(*jfc) = Sj^i %N,2,j{tk)- The next lemma shows that Z^,2,j(tk) is approximately 
determined by the random variable Wj- 

Lemma 32. Let B,± be the event that for some j < Rk, we have 

\Z N ,2,j(t k ) -7rV2e~ A N (log N) 2 W j:k \ > AN (log N) 2 9 1 / 4 . 
On G N , k -l, we have P(B A \F tk _ 1 ) < Ce A 9 b / A e' 1 / 2 + o(l). 



Proof. Define a new random variable Z' N2 j(tk) by modifying Z^ t 2,j(tk) i n the following three 
ways: 

• We set Z' N2 ^(tk) to zero if uj > t k - (log Nf' 2 . 
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• We set Z' N2j (t k ) to zero if r Vj>k ,j,k > Uj + C- 

• We modify Sj to exclude particles that, after time uj, reach La — y + (t — Uj)(y/2 — fi) at 
some time t £ (uj,tk\ but then reach La again before time t k . (Note that this modification 
is equivalent to killing particles that reach La after they reach La — y + (t — Uj)(y/2 — fx) 
at some time t > Uj.) 

Then define Z'f^ 2 ■ (t k ) by making these three modifications and replacing L by La in the definition 

(Hdsj. 

By Corollary PTT1 and Markov's Inequality, P(uR k > t k — (log N) 5 / 2 \F tk _J = o(l) on Gm,u-i- 
This implies that the first of the four modifications above is unlikely to occur. By Corollary [25] 
and (HD, 

p (r Vj , k ,j,k >Uj + C for some j|.F tfc _i) < rjE[R k \ Ji fc _J < Cr } 8e A e~ 1 ' 2 + o(l), 

which bounds the probability of the second type of modification. Lemma ED bounds the prob- 
ability of the third type of modification. These results and the fact that rj = 8 imply that on 
Gn,k-\i 

P{Z' N ,2,j{tk) ± Z Nj2>j (t k ) for some j < fl fc |.F tfc _i) < Ce^e" 1 ' 2 + o(l). (109) 

Let Tj be the event that Uj < t k - (logiV) 5 / 2 , that r Vjtk ,j, k < Uj + C, and that W' jk < d' 1 ^. 
The probability that either of the first two of these events fails to occur has already been bounded, 
so using the argument given in (|107l) . on Gjy,t fc _ 1 we have 

(110) 



p({jT<F tk \<Ce A 8^l 2 + o(l). 

V 7 = 1 ' 



Let H k -i = ^(^.pVi^, . . . , V Rk)k ,ui,. . . ,UR k , (ri,j,k)i<i<v jikl i<j<R k )- Note that Tj £ U k -\ for 
all j, and on Tj for sufficiently large N, the Vj± particles that reach La — y + (t — Uj)(V% — /•*) 
for some t > Uj are all reaching a level between La — y and La — y + C(v / 2 — /u) at some time 
between t k -\ and These particles and their descendants then evolve independently until time 
t k , and we kill particles that return to La if we are evaluating Z' N2 -(t k ) or Z'^ 2 -(t k ). 

By the argument leading to (|40p . with the times rij k playing the role of t^—i, on Tj we have 

E[\Z' K2j (t k ) - Z^, 2j -(t t )||«t-i] < V jlk (l + o(l)) 

< C7y -1 JV(logJ\0 2 e _1 / 4 (l + o(l)). 
Therefore, by Markov's Inequality and the assumption ()106|) that 1 < 8y, on Tj we have 
^(l^,2j(**) " ^, 2J (t k )\ > iVQog iV) 2 1/4 |tt fc _i) < Cy-^- 1 / 2 + (i) < C0 1 / 2 + o(l). (Ill) 



Let 

Vi,j,k — 6' 



M(-^A-?/+('-i,j,fc-Mj)(v / 2-/i)) 
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and 

. (n(LA-y + (rij t k-Uj)(y/2-{i))\ 
Zi,j,k = Vi,j,k sin I j-^ I • 

The ith of the Vj jk particles that reach La — y + (t — Uj)(y/2 — fx) for some t > uj reaches this 
level at time r^j^. Therefore, by ([15]) and ([39]) . on 1^ the expected contribution to Z'^ 2 -(t k ) 
from descendants of this particle is given by 



3 (l- M 2 /2- 7 r 2 /2-Li)(i ft -r i , i , 



^ = (1 + 0(A6) + (l))^"V)g (l-O (^4^) ) 



Thus, on Tj, 

E[Z%^(t k )\H k ^\ = Vj, k L^-y)^-{\ + O(A0) + o(l))) 

= ^- k U(log Nfe- A e-^^j (1 + 0(A9) + o(l)). 

= Tyj ifc (^\/2e" A iV(log iV) 2 )(l + O(A0) + o(l)). 

This means there is a constant Co such that for sufficiently large N, on Tj, 

\E[Z , ii^{t k )\U k ^i] - irV2e- A N (log NfW^ < C N (log NfW^AO < C Q N (log N) 2 A6^\ 

Therefore, using (|105p . 

\E[Z^^(t k )\U k ^\ - itV2e- A N(logN) 2 W'^ k \ < N(log NfO 1 ^ (112) 

for sufficiently large N . On Fj we can similarly estimate the variance of the contribution of 
each of these particles. We apply ([4*6|h with the times Ti t j )k playing the role of t k -i- Since the 
descendants of these particles after times nj^, . . . , ry. k) j jk evolve independently, we get 

Var(^ 2J (t fc )l^-i) < CON (log N) 2 e~ A (z i}j;k + (l + o(l)). 

Arguing as above and using ()106|) , we find that on Fj , 

Var(Z^ 2j (i fc )l?4-i) < k 6N(logN) 2 e- A (e^ L -^ sin f ^"^ ) + ^j^f) U + 

< CV,- k 6N (log N) 2 er 2A (ye'^N (log N) 2 + 0~ V w iV(log iV) 2 ) (1 + o(l)) 

< C^j jfc iV 2 (logiV) 4 ^- 2A (l + r^-^Cl + o(l)). 

< CiV 2 (log iV) 4 fl 3/4 (l + o(l)). (113) 
By ()112p . (|113p . and the conditional form of Chebyshev's Inequality, on Fj we have, 

P(\Z'N,2j(t k ) - irV2e~ A N (log N) 2 W!j ik \ > 2iV(log iV) 2 1/4 |?4-i) 

CN 2 (logN)W\l + o(l)) 1/4 

" (JV(logJ\O 2 0V4)2 
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Note that irV2e- A i] < 1 / 4 because A > 0, 77 = 9, S < 1 by flM]), and thus (9 3 / 4 < 1/24 3 by (fTUTJ) . 
Therefore, since P(\W'- k — Wj^\ > 77) < 77, on Tj we have 

P(l^, 2J (ifc) " vrv^e-^aogAO 2 ^! > m ( lo § N) 2 9 1 / 4 \U k - 1 ) 

< C6> 1/4 + 77 + o(l) < Cfl 1 / 4 + o(l). 

Now ([TTT]) leads to 

^(I^W**) " vrv^e-^^aog^) 2 ^-^ > 4iV(l og Nfe^\H k -i) < l r| + C0 1 / 4 + o(l). 

Taking the union over over j < R k and then taking conditional expectations of both sides with 
respect to J 7 t k _ 1 , we get 

P{\Z' N ,2,j(tk) ~ TrV2e~ A N (log N) 2 W j!k \ > m(logN) 2 e 1/A for some j < Rk^t^) 



< 



/ Rk \ 

P { U r i ^-1 )+{Ce llA + o{l))E[R k \F tk _ 1 \. (114) 
^ 7=1 ' 



The result now follows from (fT09t (fTTOjl . and pi) . 
Lemma 33. Let B^ be the event that 

Rk 



□ 



J2 Z N,2,j(tk)l { w jik <e^} > e- A / g N(logNf 



or 



Rk 



3 8A/9 



> 



7T 



V2 



Then P(B h \J : t k _ 1 ) < C(9 5 ^e A + fle"^ 9 )^ 1 / 2 + o(l) on G N>k - X . 
Proof. We have 



{W ilfe <e2A/3 } 

J = l 



= 8A/9 



> 



vrv/2 



tfc-1 > 



(115) 



where /3 = 4e A 6> 1 / 4 /(7r\/2), which by fllQ2|> is bounded by a constant. Let Q k -i = a{Tt k _^Rk)- 
Using Corollary l28l with x = e 2A ^ 3 and (f9T|) . 



Rk 



Var( E(^ + ^ 1 W, fc <e^/3} 



= E[R k Var((W + (3)l { w<e^})\ -^-J 

< + /3) 2 l {w < e ^ /3} ]£[i4|^ fc _ 1 ] 

^Ce^-C^-^ + oa) 
<C0e 5A / 3 e- 1 / 2 + o(l). (116) 
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Likewise, using Corollary [28] and ([92]) 

■ Rk 

{W j}k <e^/S} 



Var(£ 



3=1 



*fc-i 



< Var(i? fc £[(^ + /3)l {w < e2 A/3 } ]|J t& _ 1 ) 

< Var(CAfl fc |.F tfc _. l ) < CA 2 E[R 2 k \F tk _ 1 \ < C6A 2 e A £~ l l 2 + o(l) (117) 
Recall that for all random variables X and a- fields J- and with J 7 <Z Q, 

Ym(X\T) = E[Var(X\g)\T] + Vw(E[X\g]\T). 
Therefore, summing (]116p and (|1 17|) gives 

Var C Y^(Wj, k + p)l {Wj k<e , A/3} 7^) < COe^e- 1 ' 2 + o(l), 

as 74 2 e _2yl//3 is bounded by a constant. Also, using again Corollary 1281 and since 
by (HDU), 



3=1 



<C^(i? fc |Ji fc _ 1 )(/3 + 2A/3) 

< Cee -1 / 2 ^ + o(l) < C + o(l). 
Thus by the conditional form of Chebyshev's Inequality, we get 



R k 



P [ E^ + /3)l {H /, ft < e ^/3 } > 



8A/9 



3=1 



7T 



72 



< cee j e + 0( i) < cee -A/9 e -i/2 + o(1)) 



(e 8A / 9 /7r\/2) 2 



which, combined with (|115[) and Lemma [32l gives the result. 



□ 



Lemma 34. Fix r > e. Consider the event E that Z^itk) — £Jjv(tjfc-i) > r./V(log iV) 2 ; and 
consider the event F that Wj tk > r / (ir^/2e~ A ) /or some j < R k . Let B§ be the event that one of 
these two events occurs but not the other (that is, the symmetric difference of these two events). 
Then P(i?e|J 7 i fc _ 1 ) < C05e~ 5 / 2 + o(l) on G^.k-i, where the constant C does not depend on r. 

Proof. Let B be the event that \Z Njl (t k ) - Zjv(4-i)| > 4e _j4/4 iV(log N) 2 . By Corollary [H and 
Lemmas 1291 [30l I3TI l32l and [33l as well as the assumptions ([98]) and (]102p . we have on Gjvjfc-i, 



p(\jBi Tt^) < C56e- 5 / 2 + o(l). 



Therefore, it suffices to show that 



B 6 c B = |J Bi 



i=Q 



Thus, suppose first u £ E c (~) F, and let us show that uj G B. We have Wj jk > r/(ir\^2e A ) 
for some j < R k . It follows that if u 6 we have W jik > (r + 4 e - A / 4 + 4<9 1 / 4 )/(7r v ^2V A ). If 
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furthermore wGB^ B li we have Z N)2 ,j{t k ) > N(logN) 2 (r + 4e" A/4 ). Now if also oo G B%, we 
have Z N ,i(t k ) > Z N (t k -i) - 4e~ A / 4 iV(log N) 2 , so on B^nB^nB^, we have Z N {t k ) > Z N)1 (t k ) + 
ZN.2,j(tk) > Ztf(tk-i) + rN(log N) 2 , and so E occurs. Since we have assumed that u ^ E, it 
must be that w € B U S 2 U C B. 

Alternatively, suppose 10 G E n F c , hence W 3 -fc < r/(7r-v/2e _j4 ) for all j < It follows that 
on B%, we have W jJk < (r - 4e~ A / 4 - e~ A / 9 - 46* 1 / 4 ) /(vrv^e^) for all j < R k . Then on B c 2 n S|, 
we have Z NX j(t k ) < N(logN) 2 (r - 4e" A / 4 - e~ A l 9 ) for all j < R k . On there exists at most 
one j < R k such that W jtk > e 2A/3 . Therefore, on B c 2 n B% n Bj= n we have 

^(*fc) = J^Awfo) ^ W(log iV) 2 (r - 4e~ A / 4 ). 
3=1 

Finally, on Bg, we have Z N> x(t k ) < Z N (t k -i) + 4e _A/4 iV(log N) 2 , so on nf =0 Bf, we have Z N (t k ) < 
Znih-l) +rN(logN) 2 which means that E does not occur. Since we assumed u G E, it must 
be that cj G L)f =0 Bi = B, which finishes the proof of the Lemma. □ 



5.3 The small jumps 



In this subsection, we estimate the expectation in (|93p . which covers the case in which the process 
Zjv does not jump by more than eN(log N) 2 between times t k —\ and t k . We have 

Rk 

ZN(t k ) - Z N {t k ^i) = (Z N ^i(t k ) - Zjv(tfc_l)) + Z N,2,j(t k )- 

i=i 

Lemma[34]with r = e shows that with high probability, we have Z^(t k ) — Z^{t k -i) > eN(log N) 2 
if and only if one of the random variables W± k , . . . , Wn kk is greater than ej (ir\/2e~ A ). Therefore, 
in view of Lemma l32| we can approximate the quantity in ([93]) by 



S k = {Z N>1 (t k ) - Z N {t k ^)) + 7rV2e- A iV(log^) 2 ^ W j)k l 



i' kL {W jik <e/(irV2e- A )}' 



r M8) 



3=1 



which omits the contributions from terms with Wj tk > e/(ir\' r 2e~ A ). We now calculate the 
expected value of S k and will later justify in Lemma [38] that this is sufficiently close to the 
quantity in (|93l) . 

Lemma 35. On Gjf k—l) we have 

E[S k \F tk _ x ] = Z N (t k . 1 )6s{2^ 2 E[Wl {w ^ /{ ^ A)} ] - 2tt 2 A) 

+ 0(A 2 e 2 e~ 1/2 N (log N) 2 ) + o(N (log N) 2 ). 

Proof. By Lemma fTTj we have on Gjy,fc-i 

E[Z Njl (t k ) - Z N {t k ^)\F tk _ x ] = -Z N (t k ^)(2K 2 A6s + 0{A 2 e 2 ))+o(N(\ogN) 2 ). (119) 

Also, since the random variables Wj jk are independent of one another, and of Tt h _ x and R k , we 
have 

Rk 



E 



Yl W 3' kl {W ]tk <e/(irV2e-A)} 
3=1 



Ft. 



E \ W1 {W<e/(TvV2e-A)}\ E [Rk\Ft k _ 1 ] 
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Combining this result with Proposition [161 we get on G]y,k-i, 

R k 



E 



TtV2e- A N{\og Nf £ W jik l {w . k < e/{7vV 2 e -A )} 
L i=i 

= E[Wl {w ^ £/{ ^ A)] \{2V2^esZ N {t k ^){l + 0{A9)) + o{N {log N) 2 )) . (120) 
Note that from Corollary | 



vr\/2e 



—A 



< 1 + C(loge + A) < CA, 



fl211 



since loge < 0. The result now follows by combining (|119p and (|120p . and using (|12ip to help 
bound some of the error terms. □ 

It remains to bound the expected error that is made when approximating the increment 

(Z N (t k ) - ZAr(t fc _i))l{ Zjv ( tfe) _ Zjv ( tfc _ 1 )< £Ar{logA r)2} by S k . 



Lemma 36. We have E[(Z' N>1 (t k ) - Za^-i)) 2 !^^] < CW 2 (log iV) 4 (e~ A ^ 1/2 + o(l)) 

Proof. By Lemmas 1111 and |12| on Gjvfc-i, 

^[(^,i(*fc) " ^iv(i fe -i)) 2 |^ fc -i] = Var^^l^VJ + (^fefa) " ^(tfc-i)!-^]) 2 

< C6N {log Nfe- A (Z N (t k ^) + o(N (log N) 2 )) + (C^Zjv^-i) + o(N(log N) 2 )) 2 

< C9N 2 (logN) 4 e - A e- 1/2 + C A 2 6 2 N 2 {log Nfe' 1 + o(iV 2 (log TV) 4 ) 

< C#iV 2 (log N)\e~ A e~ 1/2 + A 2 ^ 1 + o(l)), 

and the result follows from (|104p . 
Lemma 37. On k -i> we have 

Rk 



on 



□ 



E 



ZZ W J'' fcl {W^, fc < e /( 7 rV2, 



< C0e 2 V/ 2 + o(l). 



Proo/. Note that e _A / 9 < C<5 < Ce by ([95]) and (J98[). Because A 2 e~ %A / 9 is bounded above by 
a constant, it follows that A 2 < Cee A . Therefore, by ([H]), ([92]), (fT2Tj) . and Corollary EH on 



Rk 



{^■fc^e/CTrv^e-' 4 )} 



^[i? fc |Ji fc _j^[w 2 i {w < e/(7r ^ )} ] +s[/2 fc (fl fc - i)|^ fc _ 1 ]( J B[Tyi 



{W<e/(7rv/2e- A )} 



< (Cfle^V 1 / 2 + o(l))(ee A + A 2 ) < Cfle 2 V/ 2 + o(l), 
as claimed. 



□ 
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Lemma 38. On Gjfk—i> we have 

E[\S k - (Z N (t k ) - ^7v(4-l))l{Z JV (t fe )-Z Ar (t fc _ 1 )<eAr(logAf) 2 }l|- 7: ifc-i] 

< C9N{logN) 2 5 1/2 e- 3 + o(N(logN) 2 ). (122) 

Proof. Throughout this proof, we work on the event Grjvfc— 1< Choose r = e, and recall from 
the proof of Lemma [M] that the event B = uf =0 Bi can also be written as B = U^ =0 Bi since 
Bq C U| =Q Sj. We will bound the following three terms: 



E 



lBc\Sk - (Z N (t k ) - 2 - Ar(tfc-l))l{Z J v(tfe)-^ivfe-i)<eA r (logAf) 2 }l -^tfc-i > ( 123 ) 



lB|(^Ar(*fe) - ^A r ( i fc-l))l{Z JV (t fc )-^jv(t fe -i)<eA r (logAf) 2 }l -^tfe-i > ( 124 ) 

E[l B |S fc ||.F tfc _i]- (125) 
We first bound (fT23i On Bg, we have Zjv(i fc ) - Zv(tjfc-i) > eN(\ogN) 2 if and only if 
Wj',fc > ^/(^V^e"" 4 ) for some j < i?^. In this case, on the event that Wj 0jk > e/(7rv / 2e _j4 ) for 
some jo < -Rfc, the difference between S k and (Z N (t k ) - Z N (t k ^i))^-{z N {t k )-z N (t k _ 1 )<eN{\ gN^} 
will simply be S k , as the latter expression will be zero. However, on Bq, we have 

\Z N ,i(t k ) - Z N (t k -i)\ < 4e~ A / 4 N(logN) 2 . 

By ([MD and the fact that 5 £ (0, e) we have e/(ir^e~ A ) > e 2A/3 . Thus W jo>k > e 2A/3 and on B\, 
for all j ^ jo, Wj <k < e 2A ^ 3 '. Thus, the definition of B§ from Lemma [331 implies that on B\ n B§, 
we have 

R k e 8A/9 



1=1 



Therefore, |S fe | < (4e~ A / 4 + e" A / 9 )iV (log iV) 2 on B c n > e/(vrV2e~ A ) for some j < Rk}- 

If however W^fc < e/(ir\/2e~ A ) for all j < R k , then on B|, the difference between S k and 
(Z N (t k ) - Z Ar (t fc _i))l {Zjv(tfe) _ Zjv(ifc _ i) < EA r (logA r )2} is bounded by 4R k N (log NfO 1 ^. Therefore, 

E[\S k - (Z N (t k ) - ZAr(t fc _ 1 ))l{ Zjv ( tfe) _ Zjv ( tfc _ 1 )< £Ar{logA r)2}|lB^|i : t fc _ 1 ] 

< (( 4e ~ A / 4 + e - A / 9 )P(W j0jk > e/(TrV2e- A ) for some j^Ft^) + AO 1 ' ^[R^Ft^N (log N) 2 . 
Now (HU) gives ElRklFt^} < C6e A e~ 1 / 2 +o(l), and Proposition [27] implies 

P(W jo , k > e/(TrV2e~ A ) for some jol^i) < Ce^e-^^fcl Ji fc _J < Cee^ 2 + o(l). 
Therefore, 

-E[|Sfc - (Z N (t k ) - Z A r(tA:-l))l{z iv (t fc )-Z J v(tfe_i)<eA r (logAf) 2 }l 1 B c |- ;r ifc-i] 

< (C^- 3 / 2 e- A / 9 + C^/Ve- 1 / 2 + o(l))AT(log iV) 2 , (126) 

which gives a bound on ()123p . 

We next bound (I124p . By Lemma [341 and its proof, we have 

P(B\F tk _ 1 ) < C65e- 5 / 2 + o(l). (127) 
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The random variable in (|124j) is bounded in absolute value by max{Zj\r(ife-i), eiV(log A'") 2 }. 
Therefore, on Gjv,fc-i> 

E[\(Z N (t k ) - Z N (t k _i))l {ZN{tk) _ ZN{tk _ l) ^ eN{logN) 2 } \l B \j : tk _ 1 ] 

< P(B\T tk _ 1 )N(logN) 2 £- 1 / 2 < C65e- 3 N (log N) 2 + o(N (log N) 2 ) . (128) 

It remains to bound (|125j) . By the conditional Cauchy- Schwartz Inequality, Lemma [36l and 

(TO, 



E[\Z' NA (t k ) - ^(^OIIbI^J < JE[(Z> (t k ) - Z JV (i fc _ 1 ))2|^_ l ]P(B|^ fc _ 1 ) 



< yJCeNiQogN^e-Ae- 1 / 2 • #fe- 5 / 2 (l + o(l)) 

< C^e-^ 2 ^/ 2 ^ 3 / 2 ^ (log iV) 2 (l + o(l)). (129) 

Likewise, by the conditional Cauchy-Schwartz Inequality and Lemma [37l 



|vrv / 2e- A iV(log AT) 2 £ fc < e / (W 2e^)} I *b 



*fe-i 



< Ce^NilogNfy/ee^e 1 / 2 . e5e- 5 / 2 (l + o(l)) 

< C^iVOogiV) 2 ^^^! + (i)) (130) 

Now Lemma HOI (fT29l) . and (fT30j) imply 

£[|S fe |l B |;Vi] < C9N(logN) 2 5 1 / 2 (e- A l 2 e-^ 2 + e- 1 ) + o(N(logN) 2 ). (131) 

The result follows from (j!26|) . (|128p . and (|13ip in view of the inequality (f98|) and 5 < e, as well 
as (fT02|) . □ 

Proposition 39. There exists a real number c such that 

E[(Z N (t k ) - -^Ar(ifc-l))l{Z J v(t fc )-Z JV (tfe_i)<eAf(logAf) 2 }|- ;r tfc_i] 

= Z N {t k - X )0s{c + 2^ 2 log e + 5 (e, A)) + 0(6N(log AO^V 3 ) + o(iV(log iV) 2 ), 
where g : (0, oo) x (0, oo) —> M. is a function such that lim^oo g(x, y) = for all x > 0. 
Proof. By combining Lemmas and and using (|1U3|) , we get 
E[(Z N (t k ) - Z N (t k - 1 ))l {ZN ( tk y ZNitk _ i) < £N( i ogN) 2 } \j : tk _ 1 ] 

= Z N (t k ^)9 S (2V27T 2 E[Wl {w ^ /i7rV ^ e - A)} } -2tt 2 A) + 0(6N(log Nfd^e- 3 ) + o(N (log N) 2 ) . 

(132) 

Denote the conditional expectation on the left-hand side of this equation by f(N, e, 9). Note that 
this expectation depends on N, e, and 8, but can not depend on 5 or A, as these constants were 
introduced just for the proof. Assume for the moment that k = 1, and the initial conditions are 
chosen so that Z/v(0) = A^(log AQ 2 . Then there exists a positive constant C such that 

N(logN)L ~ ( 2 ^^ W V<*/(^e-)}] " 2 -^) + CS 1/2 e- 3 
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and likewise 

WSfj^^- ( 2 ^ 2 ^V<^e-) } ] - *** A) - C5^e-\ 

We now simultaneously take 5 — > and vl — >■ oo. This can be done without violating the 
constraints on the constants because once 5 is chosen, we can pick A large enough to satisfy (j97|) 
and ([98]) . and then only consider 9 small enough that (f99|) - (|105p are satisfied. The second term 
Cd 1 / 2 ^ -3 then tends to zero. Since the left-hand side does not depend on A, the first term must 
also tend to a limit as A — > oo. That is, we know that 

limj2V27r 2 E[Wl {w ^ £/{n ^ e - A)} ] - 2n 2 A) exists. (133) 
Now let r = e/(ir\/2e~ A ), so 

j = log(7r\/2r) — logs. 

Therefore, the limit in (1133P is equal to 

lim 2tt 2 (V2E[Wl {w < r} ] - log(vr\/2r) + log e) = c + 2vr 2 log e (134) 

for some real number c that does not depend on e. The proposition follows. □ 

Remark 40. Equation ()133|) is a statement which concerns only critical branching Brownian 
motion with absorption and does not depend on N. It would be desirable to find a direct proof of 
this fact, but we were not able to obtain one. This would follow if one could show that 



P(W > 



V2~2 



dx < oo. 



An explicit expression for the value of the limit in (|133j) would also make it possible to identify 
the constant a appearing in the statement of Proposition d 

5.4 The large jumps 

We now estimate the probability in (|94p that the process Zjy makes a large jump between times 
tk-i and t k . 

Proposition 41. For all r > e, on Gjy^-i we have 

P(Z N (t k ) - Z N (t k - X ) > WVaogiV) 2 !^) = ^ • *logN) 2 + °^ 5/2 ) + 
Proof. By Lemma l34"t we have 

P(Z N (t k ) - Zjv(t fc _i) > rJVClogJV) 2 ^,) 

= P( w j,k > r/(TiV2e- A ) for some j\Tt k _ x ) + 0(ede~ 5/2 ) + o(l). 
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Recall that e/(vr\/2e" A ) > e 2A/3 by and the fact that 5 G (0, e). By Lemma[29l for sufficiently 
large A the probability that Wj x ^ > r /(-K^/2e^ A ) and Wj 3) k > r / '(iry/2e~ ' A ) for some j\ ^ 22 is 
at most C6e~ A/3 £~ 1/2 + o(l). Therefore, 

P(W jtk > r/{itV2e~ A ) for some j\T th _ x ) 

= E[R k \F tk _ x ]P(W > r/(7rV2e~ A )) + CKfcT^V 1 / 2 ) + o(l), 

and the error term is smaller than 0(65e~ 5 ^ 2 ) by ([55]) . 

By Proposition 1271 if we use ~ to mean that the ratio of the two sides tends to one as x — > 00, 
then 



PX PX 

E[Wl {w < x} ] = P(y<W <x)dy= P(W > y) dy - xP(W > x) ~ B log 
Jo Jo 



X. 



Therefore, ([TM]) implies that B = l/y/2. Therefore, by ([96 



d-^ <p ^ > ^_\ < (i + 5K 



re A \ TT\/2e~ A J re A 

Combining this result with Proposition 1161 we get on Gjva;— i, 

E^Ft^PiW > r/(nV2e- A )) = ^ • ■ (1 + 0(M))(1 + 0(5)) + o(l), (135) 

which is enough to imply the result. Since 1/r < and Zjy(tfc_i)/A^(log A^) 2 < e^ 1 ^ 2 on 
Gjvfe-i) the dominant error term coming from (H35]) is 0(flfe- 3 / 2 ). □ 



6 Convergence to the CSBP 

In this section, we prove Proposition Q] and Theorem [2j Both of these results require proving that 
a sequence of processes converges to the continuous-state branching process (Z(t),t > 0) with 
branching mechanism 

POO 

fy(u) = au + 2ir 2 u\ogu = -cu + 2ir 2 / (e~ ux - 1 + uxlr x <n) x~ 2 dx, 

Jo 

where c is the constant defined in (|134p . We will first establish Proposition Q3 and then use this 
result to deduce Theorem [2j 



6.1 The generator of the CSBP 

Let Co([0, 00)) be the set of continuous functions / : [0, 00) — > K that vanish at infinity, endowed 
with the sup norm so that for / G Cq([0, 00)), we have 

11/11= sup \f(x)\. 

x>0 

For / G C ([0,oo)) and x G [0,oo), let T t f(x) = E[f(Z(t))\Z(0) = x]. It is well-known (see, for 
example, |23j) that (T t ,t > 0) is a Feller semigroup. The following result describes the associated 
infinitesimal generator. This result is essentially well-known. The form of the generator appeared 
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in [63], and later in [26] where a particle representation of continuous-state branching processes 
was constructed. The fact that the set £ defined below is a core for the generator was established 
for closely related families of processes in |51|, 152] . However, we give a short proof of the result 
below for completeness. 

Proposition 42. Let A be the infinitesimal generator for (Z(t),t > 0). Let £ C Co([0,oo)) be 
the set of functions of the form 

f{x) = ai e~ XlX + --- + a m e- x ™ x , (136) 
where a±, . . . , a m G R and Ai, . . . , \ m > 0. Then £ is a core for A, and for f G £, 

Afix) = x(cf'(x) + 2vr 2 {fix + y)- fix) - yl {y < l} f {x))y~ 2 dyj . (137) 
Proof. If fix) = e~ Xx , then by ([5]) and ([6]), we have 

„—Xx 

= xe- Xx ^i\), 



. . , T t f(x)-f(x) , e -^«t(A) _ e -Ax fl 

Af fx) = lim tJV y — = lim = — e ~ XUt{ - X) 



t=o 



which equals the right-hand side of (|137p . The result (|137p then follows for all / G <f by linearity. 
By the Stone- Weierstrass Theorem, £ is dense in Co([0, oo)). By ([5]), we have T t f G whenever 
/ G £. It now follows from Proposition 3.3 in Chapter 1 of [31] that £ is a core for A. □ 

6.2 Proof of Proposition [I] 

The next result is Theorem 8.2 in Chapter 4 of [31] in the present context. 



Proposition 43. Suppose the distribution o/V)v(0) converges to the distribution o/Z(0) as iV — > 
oo. T/ien £/te finite- dimensional distributions of (V]sf(t),t > 0) converge to those of (Z(t),t > 0) 
as — > oo if and only if for all j > 0, all < si < S2 < • • • < Sj < u < u + s, all bounded 
continuous functions h±, . . . , hj : [0, oo) —> R, and all f G £ , we have 



lim E 

N^oo 



f{V N iu + a)) - f(V N (u)) - [ U+S Af(V N (t)) dt) H hiiV N isi)) 



0. (138) 



In view of this result, we will aim to establish (|138j) . which will imply Proposition [TJ We will 
assume that 0<si<S2<---<Sj<u<u + s. We also define the times 

U = To < T\ < ■ ■ ■ < Tq-i = u + s, 

where r k = i fc /(logiV) 3 for all k. This means that V N ir k ) = Z N it k )/(N(log N) 2 ) for all k. We 
also assume that the function / G £ and the bounded continuous functions h%,. . . ,hj are fixed 
throughout this subsection. 

Since / is of the form given in (|136p . the norms ||/||, ||/'||, and ||/"|| are finite and thus can 
be treated as constants. If g(x) = xf{x) and d{x) = xf'(x), then ||g||, \\g'\\, and \\d\\ are likewise 
finite. Also, if we define 

h{x) = supx|/"(y)|, k(x) = supx|/(y)|, (139) 

y>x y>% 
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then it is easy to check that \\h\\ < oo and ||A;|| < oo. Finally, if y > 0, then by Taylor's Theorem 
there is a z £ [x, x + y] such that f(x + y) = f(x) + yf'(x) + \y 2 f"(z). Therefore, 



x {f(x + y) - f(x) -yf'(x))y 2 dy 



and 



It follows that 



Lemma 44. We have 



x / {f( x + y) ~ f(x))y 2 dy 



<\k(x)\ + \g(x)\. 



P/||<|c|||d||+2^(|M| + M + -||/ i || i < x. 



(140) 



E[(f(V N {T k )) -/(^Ar(Tfc_i)))l { y JV ( rfe )_y ]v ( Tfc _ 1 )< e} |j't fc _ 1 ]lG^ fe _ 1 

= / , (^(r fc _i))yiv(r fe _i)^(c + 27r 2 loge)l GjVfc _ 1 +0(^ 1 /2 ) + o(1) . (141) 



Proof. Define 



n Z N,l( t k) - Z N(h-l) rr _/^ TT , -, 

= iTTTTTT ,xn9 +vrV2e ^ W JM{ W] k <e/{*y/2e-*)Y 

i=i 



iY(logiV) 5 



Note that Sk would be equal to Sk /(N (log N) 2 ), where Sk is defined in (|118f) . if Z' N 1 (t/ c ) were 
replaced in the definition by Z]y t i(tk)- Therefore, by Lemma [TU1 



E 



Sk 



N(\ogNf 



tk-i 



lG» h _! = 0(1). 



(142) 



Thus, by Lemma l38| 

- (Vtffa) - l^^)-^^.!)^}!^*-!]^-! < CW^" 3 + o(1) . (143) 

It follows from (p^3|) that 

^D/O^fa)) - /(Vzv-(Tfc_i) + ^^^(TTfeJ-Vjv^O^l^tfc-i] 1 ^-! 

< CH/'ll^/V 3 + o(l) < Cfl^V 3 + o(l). (144) 

By Taylor's Theorem, there exists £ between Vjv(rfe_i) and Vjv(7}fc_i) + 5fc such that 

^[(/(VAT(Tfc-l) + Sfc) - /(V^(T fc _ 1 )))l { y^ (Tfc) _y Jv(rfc _ l) < £} |j r ifc _ 1 ]l Gjv fe _ 1 

= ^[(/'(^(Tfe-i))^ + /"(0^fc/ 2 ) 1 {v N (r fc )-y w (-n fe _ 1 )< e }|^ : it & _i] 1 c7 JV , fc _ 1 
= / / (lV(r fe _ 1 )) J E[5 fc l { y iV (. fc )-Viv(^- 1 )< e }l^-i] 1 Giv,^i + O^^I^JlG^J. (145) 
Lemma 1381 and (|142j) give 

i?[|5 (fc |l {yiv(Tfc) _ yiv(Tfe _ l)>£} |^ fe _ 1 ]l GjVifc _ 1 < CW% 3 + o(l). (146) 
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Note that 5 1/2 e' 3 < e l/2 by (JgSJ, and A can be chosen large enough so that g(e, A) < e, where g 
is the function from Proposition [39] Therefore, (|146p combined with Lemma [38l equation (j!42l) . 
and Proposition [39] implies 

f\V N (T k _ 1 ))E[S k l { y N{Tk) _ VN{Tk _ l) < £} \Ft k _ 1 }l GN , k _ 1 

= f\V N {T k .. x ))E[B k \F fh _ l ]l Glttk _ 1 +0(es 1 / 2 ) + o(l) 

= f'(V N (T k ^))V N (r k ^)9 S {c + 2ir 2 log e)l {GNk _ l} + O{0e l l 2 ) + o(l). (147) 
Since e~ A £~ 1/2 < e 1/2 by ([95]) and ([55]> . it follows from Lemmas [36] and EH that 

ElSllTt^lo^ < C{9e- A s- l l 2 + e' 2A ■ Oe^e 1 ' 2 ) + o(l) < COe 1 / 2 + o(l). (148) 
The result follows from ([LTD . ([LIS]) . (fWD . and ([LTSD . □ 
Lemma 45. M^e /ioue 
E [{f(VN(r k )) - /(Viv(r fc _i)))l { y iv(r;;) _y Jv(Tft _ l)>e} |j r i fc _ 1 ]l Gjv ^ 1 

/oo 
(/(^v(T fc _i) + y) - f(V N {T k ^)))y- 2 dy + OiOe 1 ' 2 ) + o(l). 

Proof. By Proposition [5T] with r = e, 

^[/(^ / Ar(r fc _i))l { y Jv(Tfc) _ VW ( Tfc _ l)>e} |J r j ; fc _ 1 ]l GjVifc _ 1 

= /(VAr(r fe _ 1 ))P(V A r(r fc ) - Vfrfa-i) > e| ^..Jl^,^ 

= /(^-i^M^-i) • ^ 1 Gn ^ +O(06e- 5 / 2 ) +o(l) 

/oo 
/(^v(r fc -i))y- 2 dy + O(0fe- 5 / 2 ) + o(l). (149) 

To simplify notation, assume that e~ l is an integer. Then 

E [f( V N{rk))^{V N (r k )-V N (T k _ 1 )>e}\J : 't k _ 1 }'i-G N)k _ 1 

oo 

= ^2 ^[/(^( r fe)) 1 {m£ 2 <VW(rO-V A r(r fc _ 1 )<(m+l)e 2 }l- ;r ifc-i] 1 G' J v, fc _ 1 
m=e~ 1 
e- 3 -l 

= ]T f( £2m + V N (T k ^))P{me 2 < V N {r k ) - V N (n-x) <(m + l)e 2 ]J c i fe _ 1 )lc? J v ife _ 1 

m=e~ 1 

+ ^[(/(^v(r fc ))-/( e 2 m + F JV (r fe _ 1 )))l 

m=e~ 1 

+ E[f(V N (T k ))l {VN(Tk) _ VN{Tk _ i)>£ -i } \T tk _ 1 ]l GNJi _ 1 . (150) 

Denote the three terms on the right-hand side of (|150p by Ti, and T3. Proposition |4"T1 gives 

|T 2 | < £ 2 ||/ / ||P(^(r fc ) - V N (n-i) > e^t^la^ 

< C9eV N (T k ^)l GNk ^ + OiOde- 1 ' 2 ) + o(l) < C^e 1 / 2 + fe" 1 / 2 ) + o(l) (151) 
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and 

|T 3 | < \\f\\P(V N (r k ) - V N (r k ^) > e^l-F^Jle*,^ 

< Ce£V N (T k ^)l GN ^ + 0(95e- 5 / 2 ) + o(l) < COie 1 / 2 + fe~ 5 / 2 ) + o(l). (152) 

By Proposition 1411 and the fact that 

1/1 1 



e 2 \m m + lj e'^mim + l) 1 

we have 

P(me 2 < V N (r k ) - V N (r k ^) < (m + l)e 2 \ J^Jlc^ 

= 2n^sV E (^) 0(gfe -5/ 2) + o(1) . 

e 2 m(m + 1) 

Adding up at most e -3 error terms of order 65e~ 5 ^ 2 to get a single error term of order 65e~ ll l 2 , 
we get 



r>e 2 (m+l) 



c >>£ (m+lj 

= 27r 2 esV N (r k . 1 )l GN ^ 1 f^rn + V N {r k ^)) y~ 2 dy + O(0fe- n / 2 ) + o(l). 

, i/ £ 2 m 

(153) 

Because an error of at most ||/'||e 2 is made when replacing f(e 2 m + Vjv^jfc-i)) by f(V]y(T k _i) + y) 
with £ 2 m < y < e 2 (m + 1), we get 

f{V N {T k „i) + y)y~ 2 dy 



re (m+l) rc 

J] f(e 2 m + V N (T k ^)) / y- 2 dy- / 

_i J e 2 m Je 

<c( T e 2 y- 2 dy+ / < f ,-. 



Combining this with (|153p gives 

/(tfrfa-i) + y)y- 2 + O^e 1 ' 2 ) + 0{65e- 11 ' 2 ) + oil). (154) 

Since we have chosen 5 < e 7 < e 6 by (|95p . the lemma now follows by summing (|151 j) . (|152p . and 
p54p and subtracting ([149|) from the result. □ 

Note that 

poo fl 

f'f^\„.-2 i , _ w/ \ / -1 



V 1 {y<i}f( x )v dy = f'(x)J y dy = -f'(x) log e. 
Therefore, for every e > one can write 

A/(a;) = A 1 /(x) + A 2 /(x), 



where 



Mf{x) = x((c + 2ir 2 loge)f'(x) + 2vr 2 j* (f(x + y) - f(x) - yf'(x))y- 2 dy^j (155) 



and 



A 2 f(x) 

Lemma 46. On Gjvfc-i; we have 



x 2<k z 



{f(x + y)-f(x))y- 2 dy). 



E 



L {\VM(t)-V N (r k ^)\>^} 



dt 



Tk-l 



< C6e 2 + o(l). 



Proof. Since 9 < 6* 1 / 4 , and since 5 < e 5/2 by {22]), it follows from (fT02l) that 9e A e~ 1/2 < e 2 . 
Therefore, by Proposition [16] and Markov's Inequality, on Gjv.fc-i, 

P(R k > Ol-Ft^ ) < C6e A e- 1 ' 2 + o(l) < Ce 2 + o(l). 

(^v,i(ifc) - Z jV (t fe _ 1 ))/(7V(log iV) 2 ) = V N (r k ) - Vfrfa-i) on G Ntk -i D = 0} and 4e~ A / 4 < e 2 
by {(95]) and dSHJ), it follows from Corollary Q3] that 



P(|^v(r fe ) - V N {r k -{)\ > e 2 ]^) < Ce 2 + Cfle-^V 1 / 2 + o(l) < Ce 2 + o(l). 



(156) 



We claim that (|156p also holds with r k replaced by any t such that T k -\ < t < r k . Applying 
Corollary 1 1 3 1 req uires specifying five parameters: u, s, e, A, and 9. To establish the claim, we apply 
Corollary [T3l with new parameters u = i&_i/(log -/V) 3 , s = s, e = e, A = A, and 9 = (t — r k -i)/s. 
Note that 9 < 9, so the conditions (|32p ~(|35 p continue to hold with the new parameters. Also, 
using the new parameters, we get to = u(log N) 3 = tj.-\ and t\ = {u + 0s)(log N) 3 = t(logN) 3 . 
It thus follows from Corollary 1131 that 

P(\V N {t) - V N (n-i)\ > e'l^-J < Ce 2 + Ch-^ 2 ^ 2 + o(l) < Ce 2 + o(l). 

Here the constant C does not depend upon the choice of t. The absolute value of the o(l) can 
be bounded above by B^(t), where B^(t) < 1 for all N and t, and limjv-»oo B^(t) = for every 
fixed t. Thus, by Fubini's Theorem and the Dominated Convergence Theorem, 



E 



Tk-l 



l {\VN(t)-V N (T k -i)\>e*} 



dt 



< 



Ce 2 + B N {t) dt < C9e 2 + o(l) 



Tk-l 



as claimed. 

Lemma 47. We have 



□ 



E 



T k 



Aif(V N (t)) dt 



Tk-l 



/' (VArfa-i))V/vfo_i)0*(c + 2vr 2 log e)l Gw *_ a + 0{9e 1 ' 2 ) + o(l). 
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Proof. If < y < e, then 



f(V N (t) +y) = f(V N (t)) + yf(V N (t)) + -f"(Qy 2 



for some £ y satisfying VKr(i) < £ y < Vjv(i) + £. Therefore, 

"Oe-1 



VivW / (f(V N (t) + y)~ f{V N {t)) - yf(V N (t)))y- 2 dy dt 



o 



Tk 



Tk-1 



V N (t)[ / ^f"(tiy)dy)dt 







< 9s sup sup ^iv(i)|/"(^)| 

te[T k _ L ,T k ] ze[v N (t),v N (t)+e] 1 

< CeOs, 



(157) 



where the last inequality follows from the fact that \\h\\ < oo, where h is defined in (I139|) . 
Equations (j!55j) and ()157|) give 



E 



Aif(V N (t)) dt 



Tk-1 



(c + 2tt 2 log£)£ 



V N {t)f{V N {t))dt 



Tk-1 



tk-i 



l GN ,k-i+O{0s) 



(158) 



Recall that d(x) = xf'(x) for x > 0. Therefore, 

"Tfe 



^v(t)/'(^v(t)) d* 



/'(^(Tfe-l))^^-!)^^,.! + S 



d(Vjv(t)) - ^(rjfc-i)) 



tk-1 



1 G 



JV,fc-l 1 

(159) 



The absolute value of the second term on the right-hand side of (|159p is at most 

<"T k 



2 d WE 



V {\V N (t)-V N (r k ^)\>^} 



dt 



Tk-1 



^t k -l 



l GN ,k-i+0s\\d'\\e 2 



which is at most C9e 2 + o(l) by Lemma [ 

Therefore, the result follows from (I158P and (1159p . since e| loge| < e 1//2 for sufficiently small 
e. □ 



Lemma 48. We have 

fTk 



E 



A 2 f(V N (t)) dt 



Tk-1 



tk-i 



1g 



JV,*:-1 



2vr 2 ^^(r fc _ 1 )l GiVfe _ 1 / (/(Vat^O + y) - f{V N (T k ^)))y~ 2 dy + 0{9e) + o(l). 
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Proof. For y > 0, define the function g y (x) = xf(x + y). Note that sup^xj \\gy\\ < oo and 
su Pj/>o Wd'yW < 00 • We have 



7* 



As/OM*)) eft 



T~k-1 



E 



2vr 2 



VW(t) y (/(Viv(t) +y)- f(VN(t)))y- 2 dy dt 

/oo 
(/Ofrfa-i) + y) - /(^v(r fc _i)))y- 2 



(»y(Vy(t))-fld(Viv(t)) 
- &/0M r fc-l)) + go{VN{Tk-\)))y' 2 dy dt 



1g, 



(160) 



The absolute value of the second term on the right-hand side of (|160j) is at most 



2tt 2 s 



2 sup 11^11+211,7011 )E 
£ \ y>o 



T~k-1 



1 {\V N (t)-V N (T k . 1 )\>e^} dt 



^-Gn.k—x 



+ 27r 2 ^e(sup|| 5 ;|| + || 5o || 
Vy>0 



using that J £ °° y 2 dy = e . By LemmalHJ this expression is at most C0e+o(l), which, combined 
with (|160p . implies the result. □ 

Proof of Proposition Ul Recall that we need to establish ()138p . For 1 < k < define 



Jk = f(V N (T k )) - /(Vjvfa-l)) 



Tk 



Af(V N {t)) dt. 



Tk-l 



Then 



fc=0 



/u+s " 
Af(V N {t))dt = Y J Jk- (161) 
k=i 

Let B N)0 = G C N0 , and for 1 < k < 9" 1 , let B Njk = G Njk -i D G° Nh . Then G N {ef = L> 9 k ~J Q B N:k and 

e- 1 

l-P(G N (e))=J2 p ( B N,k)- 

Now 

e j n h i( v N^i))] = ^ [( e j * f 1 ^,,-! + e ) ri ^(^v( Si )) 

, , j £ — 1 -J. £ — 1 j 

E^iG^jn^c^^)) + e 4( e ^W,n^(^)) 



fc=i 7 i=i 



fc-l 



(162) 
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For < £ < 9~ l - 1, 



E * 

k=e+i 



f(v N (u + s))- f(v N { n )) 



u+s 



Af{V N {t)) dt 



< 2||/|| + a\\Af\\. 



Therefore, the absolute value of the second term on the right-hand side of (|162p is at most 



nilM) (2||/|| + E P(B N ,t)<C(l-P(G N (e))), 



i=l 



using (|140p . To bound the first term on the right-hand side of (|162p . note that by conditioning 



E 



k=l 



i=l 



E^^-ijIIwc 8 *)) =E E JkiG^UHVNisi)) 

k=l L i= i 

*± r/ i \ 



/c=i 



i=l 



By Lemmas [Ml H3 EH and [Ml 



\E[J k \F tk _ 1 ]l GN , k _ 1 \<C6s 1 l 2 + o{l) 



for all fc. Therefore, 



E 



(E^^-On^^^))! ^ (nn^iOfE^o^i^-i] 1 ^-!!] 

^fc=l ' t=l - 1 M=l ' ^fc=l 

< Ce 1/2 + o(l). 



It follows that 



E 



k=l ' i=l 



< Ce 1 / 2 + C(l - P(G N (e))) + o(l). 



In view of (|161 j) and Proposition [23l equation (|138p now follows by letting N — > oo and then 
letting e — > 0. □ 

6.3 The number of particles 

Because the value of Z^{t) approximately determines the number of particles a short time after 
time t, the fact that the number of particles converges to a continuous-state branching process 
follows rather simply from Proposition [TJ 

Proof of Theorem® In view of Proposition [IJ it suffices to show that for any fixed t > 0, we 
have 

L -M N ({\ogNft)-V N (t) 



2ttN 



0. 



(163) 
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Let 7 > be arbitrary. Set u = and s = t. By Proposition [23] we can choose e G (0, 7) 
sufficiently small that 

supf limsup(l-P(Gj V (e))) > ) < J, (164) 

where the supremum is taken over all 9 such that E N. Proposition 01] implies that for 
sufficiently small 0, 

P(|VKKt) - Viv(t(l - > 7) < C6e-^ 2 + (1 - P(G N (e)) + o(l). (165) 

It follows from (|164p and (| 165j) that for sufficiently small and sufficiently large N, 

P(\V N (t) - V N {t{\ - 9))\ < 7) > 1 - 7. (166) 

Let M' N ((log N) s t) denote the number of particles at time (log N) 3 t whose ancestor at time u 
is in (0,L) for all (log Nf(t(l - 6)) <u< (log N) 3 t. By Proposition [16] and (fT64"|) , for sufficiently 
small 9 > and sufficiently large N, 



P{M N ((logNft) = M' N ((logNft)) > 1 - 7. 
By {HD and the fact that 1 - fi 2 /2 - n 2 /2L 2 = 0, 



(167) 



E[M' N ((logN)h)\P {logNmi _ e)) ] 
Now 



2N(logN) 2 V N (t(l - 9))(1 + o(l)) 
L 



L 



dy. 



e'W sin 



vry\ 
L 



dy 





7T 
7T 



7ry 
L 



I'D 



dy + 



ny 
L 



Try 
L 



dy 



Try 
— < 
L 



dy 



+ 



J e-^^dy\+0(e^ L ) 



^°'i^) +0(e ~ ML) = 2T (1+o(1)) - 



It follows that 



vriV(logiV) 2 l^(l-#))(l + (l)) 



E[M' N ((\ogN)H)\P {logN y Ht{1 _ e)) ] - L , 

= 2irNV N (t(l - 9))(1 + o(l)). 

Therefore, for sufficiently large N, 
-M' N ((logN) 3 t 



P 



E 



Pi 



(logAT)3(t(l-fl)) 



V N (t(l - 6)) 



< 7 > 1 - 7. 



(168) 



2irN 

By Proposition I14| we have 

Var(M^((logiV) 3 t)|J- (logiV) 3 (t(1 _ e)) ) < C9e- 1 ' 2 N 2 (l + o{\)) 

on an event defined in the same manner as Grjv,fc— 1 but with (log ./V) 3 (i(l — 9)) playing the role 
of tjfc_j. Combining this result with (j!64|) and the conditional form of Chebyshev's Inequality, we 
get for sufficiently small 9 and sufficiently large N, 



P 



M' N ((logN)H) 



2irN 



E 



M' N ((logN)H) 



2irN 



Pt 



(logAT)3(t(l-0)) 



< 7 > 1 - 7. 



(169) 
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If now follows from (|166p . (|167p . (|168p . and (|169p that for sufficiently large N, we have 



P 



1 ;M N ((logNf)-V N (t) 



< 37 > 1 - 47. 



2vriV 

The result (TT63D follows. □ 

7 Convergence to the Bolthausen-Sznitman coalescent 

In this section, we prove Theorem [3j The strategy will be to show that a sequence of processes 
that describe the genealogy of branching Brownian motion converges to a flow of bridges defined 
in [8], which is known to be dual to the Bolthausen-Sznitman coalescent. 

7.1 The flow of bridges 

Consider the continuous state branching process of Proposition [T] and Theorem [2] with branching 
mechanism \P(u) = au + 2ir 2 ulogu. Recall from [7j that we can define this as a two-parameter 
process (Z(t,x),t >0,x> 0), where t is the time parameter and x is the initial population size. 
Also recall from [7J that we can associate with this continuous-state branching process a flow of 
subordinators. On some probability space, there exists a process (5( s, *)(x),0 < s < t,x > 0) such 
that: 

• For every < s < t the process 

S (s,t) = ( S (sA( x ),x > 0) is a subordinator with Laplace 

exponent u t - s . 

• For every integer k > 2 and every < ii < . . . < t k , the subordinators S( tl > t2 \ S^- 1 '^ 
are independent, and 

• The processes (Z(t,x),t > 0,x > 0) and (S^°'^(x),t > 0, x > 0) have the same finite- 
dimensional marginals. 

Here S^{x) can be understood as the descendants in the population at time t of the first x 
individuals in the population at time s. 

Suppose that we start with the initial population Z(0) = z. For each s < t, we can define the 
renormalized process (B St t(x),0 < x < 1) by 

B Sjt (x) = S^ixS^iz^/S^iz). 

It is easily seen that B St t is a bridge, which we define as in [8] to be a nondecreasing, [0, l]-valued 
stochastic process (B(r),0 < r < 1) with exchangeable increments and right-continuous paths 
such that B(0) = and 5(1) = 1. 

It follows from ([7]) that when \l/(it) = au + 27r 2 tilogti, the subordinator S 1 ^'^ is a stable 
subordinator with index e - 27r (*- s ). Consequently, letting i? Si t denote the range of B s j, the lengths 
of the disjoint open intervals whose union is [0, 1] \ R Sy t are independent of S^°' s \z) and have the 
Poisson-Dirichlet distribution with parameters (e~ 27T ^~ s \ 0). See |60| for a definition and further 
discussion of the two-parameter Poisson-Dirichlet distribution and its connections with stable 
subordinators. It now follows (see Example 2 in [8]) that (B St t{x), 0<s<t,0<x<l)isa flow 
of bridges, which is a collection (B s j,0 < s < t) of bridges such that if Id denotes the identity 
function from [0, 1] to itself, then 
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• For every s < t < u, we have B S)U = Bt )U o B s j- 

• The law of B s j only depends on t — s. 

• If si < S2 < ■ ■ ■ < s n , then the bridges B S1;S2 , . . . , B Sn _ 1>Sn are independent. 

• Bop = Id and Bqj — > Id as t — > in probability, in the sense of Skorohod's topology. 

Note that we are using a different convention for the time parameters than in [5J. The bridge 
B s j defined here would be called B_ t ,-s in [8]. 
If B is a bridge, define, for u E [0, 1], 

B-\u) = inf{s e [0, 1] : B(a) > «}. (170) 

If s < t < u, then = £?~+ o B^ u . Given independent random variables Ui, . . . , U n with the 
uniform distribution on [0, 1], we can define tt(B) to be the partition of {1, . . . , n} such that i and 
j are in the same block of ir(B) if and only if B~ 1 (C/,;) = S -1 (J7j). Now, given a flow of bridges 
(B s t, < s < t) and independent uniform random variables U\, . . . , f7 n , we can fix a time i > 
and consider the partition-valued process (n(s),0 < s < t) defined by II(s) = Tr(B t - St t)- The 
main result of Bertoin and Le Gall [8] establishes that this process is a so-called exchangeable 
coalescent process and that there is in fact a one-to-one correspondence between flows of bridges 
and exchangeable coalescent processes. In the example above, in which the flow of bridges is 
defined from a continuous-state branching process with ^(u) = au + 27r 2 ulogu, the process 
{^{Bt- s /2-K,t)i^ ^ s < 27rt) is the Bolthausen-Sznitman coalescent run for time 2nt (see, e.g., 
Example 2 in [8]). 



7.2 Flows describing the genealogy of branching Brownian motion 

To represent the genealogy of branching Brownian motion, we now introduce a sequence of 
discrete versions of these flows of bridges. We fix K E N and the times = to < t\ < . . . < tx- 
For < i < j < K we will define a process (B^ t ,(s), < s < 1). 

We consider the branching Brownian motion Xjy at the successive times tj(log iV) 3 . We assign 
labels to the particles at these times, and denote by Uj.j the label of the ith largest particle at 
time tj(log N) 3 , that is, the particle in position Xi t N(tj (log -^V) 3 ). We first define a collection 
of independent random variables (vij,i > 0, < j < K) having the uniform distribution on 
[0, 1]. For i < Mjy(O), we define u^o = Vj,o- That is, the individuals at time zero are labeled by 
independent uniform random variables. For j > 1, the Uij are sequences of length j + 1 which 
are defined inductively by saying that Ui t j = (u p ^j_i,Vij), where u p ^j_i is the label of the 
particle at time i,-_i(log iV) 3 from which the ith particle at time ^-(logiV) 3 has descended. That 
is, we concatenate Vij with the label of the ancestor of the ith particle to obtain the label of the 
ith particle. The particles at time tj(log iV) 3 can now be ordered using the lexicographical order 
of their labels. We denote by Xj.j the position of the ith individual in this lexicographical order 
at time t,-(log iV) 3 . 

We now assign weights to the individuals. For < j < K and 1 < i < M^(tj (log iV) 3 ), define 

y 77 tt 1 vv , '" J ' sm (^jA l {XlJ < L} if < j < K - 1 
w(i,j) = l ZN^logNy) \ L J 

, M N ( tj (logN)3) ^3=K. 
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That is, the particles are weighted proportional to their contribution to the sum in Q, except 
at time tx(}og N) 3 when all particles are weighted equally. We use these weights because we 
will later sample particles uniformly at time £^(log A r ) 3 , but the number of descendants that a 
particle at time ti(log iV) 3 has at time ^(logiV) 3 will be roughly proportional to the weight that 
it has been assigned. Also define Ai(j,k) to be the set of descendants at time t&(log iV) 3 of the 
ith individual at time t,-(log iV) 3 . More precisely, A{(j,k) is the set of indices I such that the 
individual at position at time ifc(logiV) 3 is descended from the individual in position Xij 
at time tj(logiV) 3 . We are now ready to define the discrete bridges. First, for < y < 1, and 
< j < K, define 

L j (y)=maxlleN:J2w(i,j) < y\, (171) 
^ i=i > 

with the convention that the maximum of the empty set is 0. We think of Lj{y) as being 
approximately the yth quantile of the population at time ij(log iV) 3 when individuals are weighted 
as above and ordered according to their labels. Then for < y < 1 and < j < k < K , let 

Lj{y) 

<**(y)=E E 

i=l m£Ai(j,k) 

Note that these discrete bridges B^ tk are not exactly bridges in the sense defined above; for 
example, their increments are not exactly exchangeable because there are only finitely many 
particles at time tj. However, we will show in Lemmas [52] and [Ml below that these discrete 
bridges converge to the bridges Bt j} t k - 



Lemma 49. If < i < j < k < K , then B^ t = B£ t o B? t . and {B^ t )~ l = (B N ^ 1 



ti,tk tjtk ti,tj \ U,tk> 



O 



(B^ tk ) 1 , where the inverse functions are defined as in (170). 
Proof. For < y < 1, 

Li(y) 

B ltM=Y, E w ^ k )- 

i=\ m£A t (i,k) 

Note that m £ Ai(i, k) for some £ < Li(y) if and only if m £ Ap(j, k) for some i < Lj(B^ t ,(y)). 
Therefore, 

<t k {v)= E E Hm,k) = Bg tk (B» tj (y)). 

= 1 m£A e (j,k) 



That is, BP tl = B¥ t oBF,. Also, 



(B^ tk )- 1 (y)=mf{ S :B^ tk ( S )>y} 

= mi{s : B» tk (B^(s)) > y} 

= m({s:B» tj (s)>(B» tk )- 1 (y)} 

= (B^)-H(Bl tk )- l (y)), 

which implies that (B&J" 1 = (Bg,.)' 1 o (B&tJ" 1 . □ 
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7.3 Convergence of one bridge 



Let (-B Sj t,0 < s < t) be the flow of bridges defined above from the continuous-state branching 
process with branching mechanism ty(u) = au -\-2tt 2 u log u. We will now show that for 1 < i < K, 
the sequence of discrete bridges (BQ t (u),0 < u < 1) converges to (£0,^(1*), < u < 1) in the 
sense of finite-dimensional distributions. The first step is the following extension of Proposition 

m 

Lemma 50. Assume that the initial population is subdivided into m possibly random subgroups 
Si, ... , S m , and that given the initial positions of the particles, they evolve according to branching 
Brownian motion killed at 0. Assume that Y}v(0)/(iV(log N) 3 ) converges to zero in probability. 
Let Zi^{t) denote the contribution to the sum in (0|) from particles descended from one of the 
particles that is in Si at time zero, and let Mi wit) denote the number of particles at time t 
descended from one of the particles that is in Si at time zero. Assume that the initial joint 
distribution of 

converges as N —> 00 to some probability measure p on [0, oo) m . Then the finite- dimensional 
distributions of the m- dimensional vector-valued processes 



Z hN (t(logNY 
JV(log TV) 2 



, t > > and 



M ijN (t(logNf 
2nN 



,t>0 



each converge as N — >• 00 to the finite- dimensional distributions of {(Zi(t))i^ 1 ,t > 0} , where 
(Zj(0))^L 1 has distribution p, and conditional on (Zj(0))^L 1; each Zi evolves independently as a 
continuous- state branching process with branching mechanism ty(u) = au + 2iru\ogu. 

Proof. While this is in principle a simple extension of Proposition [TJ some care is needed in 
the proof because the components of the process are not independent but only conditionally 
independent given the initial configuration. To ease notation, we only show here the proof of the 
one-dimensional marginal convergence (which is all that is needed later), as the general result 
is conceptually identical but more cumbersome. Thus, let t > 0, and fix arbitrary bounded and 
continuous test functions fi, - ■ ■ ,f m '■ [0,oo) — > R. By Skorohod's Representation Theorem, we 
may assume that all the branching Brownian motions Xn are constructed on the same probability 
space in such a way that the expression in (|172p converges almost surely to (Zi(0))^] =l having 
joint distribution p. 

For i = 1, ... ,771, let X^n denote the branching Brownian motion obtained by considering 
only the descendants of the particles in Si. Let T = a(Xi ) jsr(0),i = l,...,m, N = 1,2,...) 
be the filtration generated by all the processes at time zero for all subgroups. Let also Q = 



a(Zi(0), . . . , Z m (0)). Note that the random variables Zi^(t(log N) 3 ) 
conditionally independent given J 7 . Therefore, 



,Z m>N (t(logNY 



arc 



E 



W< 

i=l 



Z hN (t(\ogN) 3 ) 
N(logN) 2 



E 



E 



E 



i=l 



.i=i 



z iiN {t(\o g N) 3 : 

N(logN) 2 

Zj, N (t(logN) 3 ) 
N(logN) 2 



7 



(173) 
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By Proposition HJ for 1 < i < m we have that E[fi(Z iiN (t(log N) 3 )/(N(log A^) 2 )) | J^"] converges 
almost surely to the random variable E z .^[fi(Z(t))], where E x [fi(Z(t))] denotes the expected 
value for the continuous-state branching process started from the value Z(0) = x. The application 
of Proposition [1] is justified here because the condition that Yjv(0)/(iV(log ^) 3 ) converges in 
probability to zero is satisfied for the entire process, and hence the analogous condition is satisfied 
for each of the m components. We may rewrite this random variable as 

E Zi{0) [f i (Z(t))} = E[f i (Z i (t))\g]. 

Since all random variables on the right-hand side of (|173p are bounded, we deduce by the Domi- 
nated Convergence Theorem that 



lim E 



Zi, N (t(logNf 
N(\og N) 2 



E 



i=i 



E 



li=l 



since the random variables are conditionally independent given Q. This completes the 

proof of convergence for the processes Z^. The proof for the processes M^n is identical, except 
that we invoke Theorem [2] instead of Proposition [TJ □ 

Before proving the convergence of bridges, we establish the following lemma, which states 
that at a typical time t, no single particle makes too large a contribution to Z^(t). 



Lemma 51. Let 

m N (s) 



max 

l<i<M N {s{\og TV) 3 ) 



e ^X ltN {s{\ogNf) gin 



TTX itN (s(logNf) 



L 



Then for all s > 0, we have (s)/(N (log N) 2 ) — > in probability as N — )■ oo. 

Proof. Suppose (xn)n=i is a sequence such that e fMXN /(N(log N) 3 ) — > as N 
wn = L — xat, we have wn — > oo as N — > oo. Therefore, 



**« sin (^L) = e^~^ sin ( ^ 



ire^ L 

< —j— • w N e-^ WN = o{N{\ogNY 



oo. Letting 



(174) 



Observe that Yjv(s(log N) 3 )/(N(log N) 3 ) converges in probability to zero, which is true by as- 
sumption when s = and by Proposition [23] when s > 0. Therefore, given any subsequence 
(Ni)£li> there is a further subsequence (N jk )f =1 such that Y N . (s(log N jk f)/(N jk (log N jk f^ 



converges to zero almost surely. It follows from (|174p that (s)/(Nj k (logNj k ) ) converges to 
zero almost surely, which implies the result. □ 



Lemma 52. Assume the hypotheses of Theorem^ hold. Recall that = to < t\ < • • • < t^. Let 
m > 1 and let = uq < u\ < . . . < u m = 1. Then for each fixed i, with 1 < i < K , we have 



where 



(J3fo(«i))$Li =► (5 , ti K))f =1 , 

denotes convergence in distribution as N 



oo. 
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Proof. It suffices to prove the joint convergence of the increments (B^ t .(uj 
Define Lq as in (|17ip . and for 1 < j < m, let 

Sj = {Lo(uj-i) + 1, Lo(uj-i) + 2, . . . , L (uj)} 



N („, \ r>N (., 



))j=V 



be the subset of particles in the population at time zero associated with the quantiles in [uj—i,Uj). 
Note that the Sj are disjoint and divide the population at time zero into m subgroups. We treat 
the positions of the particles in these m subgroups as m random starting configurations, to which 
we will apply Lemma [50l 

For 1 < j < m, define the process (Zj^(t),t > 0) as in Lemma 1501 We claim that the 
distribution of 

^(logiV) 2 J J=1 (1?5) 

converges as N — > oo to some probability measure p on [0,oo) m . Here p has the distribution of 
(6jX)'jL 1 , where Sj = Uj — Uj—\ for 1 < j < m and X has distribution v. To check that this 
convergence holds, note that 

|%jv(0) - $iZjv(0)| < 2771^(0), 

where m7v(0) is defined as in Lemma [51] and the error term 2mjv(0) comes from the fact that 
Ylk=i e^ Xk '° sm(irxk } o/L) increases discontinuously with j. In view of Lemma IST] the convergence 
of the distribution of (|1 75j) to p follows by Slutsky's theorem (see Corollary 3.3 in Chapter 3 of 
|31j ) and Proposition [TJ Therefore, the hypotheses of Lemma l50l are satisfied. 
Assume for now that i < K — 1. By Lemma [50l 



Z^ N {tj{\ogNf 
iV(logiV) 2 



where {{Zj(t))'jL 1 ,t > 0} is defined as in Lemma [50l Thus for any a > 0, 



Z j}N (ti(logN) 3 ) 



Z N (U(logNf)VaN(logNf 



3=1 



Zj (ti 



(176) 



3=1 



Choose 7 > 0, and let a be such that P(Z[tj) < a) < 7, where (Z(t),t > 0) is a continuous- 
state branching process with branching mechanism ^ and initial distribution u , which is possible 
because ^({0}) = and (Z(t),t > 0) never goes extinct. Thus, by Proposition [1] we have for N 
large enough, 

P(Z N (U(logNf) < aN(\ogN) 2 ) < 2 7 . 
Now fix fx, ... , f m , some arbitrary bounded and continuous test functions on [0, 1] and let M = 



■ \\fm\\- Thus, we have 



E 



3=1 



E 



3=1 



Zj {ti 



< 



E 



3=1 



Z hN {ti{\ogNf 



aN (\ogNY VZ^v^OogiV) 3 ) 
+ MP(Z N {ti{\ogNf) < aN{\ogNf 



E 



3=1 

MP{Z(ti) < a). 



Zj (ti 
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Taking the limsup of both sides, we find that the first term in the right-hand side of the above 
inequality converges to by (|176p . and the second and third terms are respectively smaller than 
2M7 and M7. Since 7 > is arbitrary, and since 

( zjiu) r 

has the same distribution as (BQ !ti (uj)— Bo^U]-!))™^, this finishes the proof when 1 < i < K—l. 

The proof when i = K is the same, except Zj y ^(tii\og iV) 3 )/(iV(log iV) 2 ) needs to be replaced 
throughout the argument by Mj^ (U (log N) 3 )/(27rN), where the processes (Mj^(t),t > 0) are 
defined as in Lemma loDl □ 



7.4 Joint convergence of bridges 

In this subsection we extend the convergence obtained in Lemma [52] to the joint convergence of 
the finite-dimensional distributions of several bridges. We begin by establishing a result about 
the convergence of the distribution of a single bridge, conditional on the branching Brownian 
motion up to the starting point of the bridge. 

Lemma 53. Assume the hypotheses of Theorem^ hold. Recall that = to < t\ < ■ ■ ■ < tx- Let 
m > 1, and let = u$ < u\ < • • • < u m . Let f : [0, l] m+1 — ^ M be bounded and continuous. For 
< % < K — 1, we have 

E[f(Bl ti+1 (u ), . . .,B» ti+1 (u m ))\T tiilogN)3 ] ^ p E[f(B tiM+1 (u ), B ti>u+1 {u m ))], 

where —> p denotes convergence in probability as N — > 00. 

Proof. Let (Z(t),t > 0) be a continuous-state branching process with branching mechanism \I/ 
and initial distribution v. By Proposition [H 

Z N ( tl (logNf) 
iV(logiV) 2 ^ Z ^- 

Also, we have Y/v(tj(log iV) 3 )/(iV(log iV) 3 ) — > p by assumption if i = and by Proposition [23] 
if z > 1. Therefore, by Skorohod's Representation Theorem, the branching Brownian motion 
processes (X^,N > 1) can be constructed on a single probability space in such a way that 
Z7v(ii(logA0 3 )/(iV(logA0 2 ) -> Ziti) a.s. and Y N (t^log N) 3 ) / (N (log N) 3 ) -> a.s. Further- 
more, it can be arranged that the processes Xn evolve independently of one another after time 
t; (log AO 3 . 

Let Tt = ct(Xn(s), N > 1, < s < t) be the cr-field generated by all the information up to time 
t by all processes. By the Markov property, conditional on ^(logN) 3 ! the process X^ evolves after 
time tj(logiV) 3 like a branching Brownian motion with absorption whose initial configuration is 
that of A0v(ti(logA0 3 ). Therefore, we can apply Lemma l52l with — ti playing the role of ti 
in Lemma [52l to get that on this probability space 

E [f( B U,u +1 {u ), ■ ■ • ,^i I+1 ( u m))l-^ ! (io g 7V)3] ->■ E[f(B 0tU+1 ^ u (u ), . . . ,B 0l i, +1 ^,(«m))] a.s. 
The result follows because the bridges Bq^+x-u and Bt it t i+1 have the same law. □ 
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Lemma 54. Assume the hypotheses of Theorem^ hold. Recall that = to < ti < . . . < and 
let = uq < u\ < • • ■ < u m < 1. Then 



( B U,t 1+1 ( u j))o<i<K~l => {B U ,U +1 {Uj)) <i<K-l, 
l<j'<m l<j<m 



(177) 



where the bridges Bt it t i+1 , < i < K — 1, are independent. 

Proof. We proceed by induction. The convergence of (B^ tl {uj))i<j< m to {Bt ,ti{ u j))i<j<m is a 
consequence of Lemma [52j Thus assume that the convergence (|177p holds for < i < k — 1 with 
2 < k < K — I. Let /i, . . . , fk ■ [0, l] m+1 — y R be bounded continuous functions. By Proposition 
[TJ we know that 



(^(i! (log AO 3 ), • • • , Z N (t k (logN) 3 )) => (Zfa), . . . , Z(t fe )), 



(178) 



where (Z(t),t > 0) is a continuous-state branching process with branching mechanism \t and 
initial distribution v. Let T% = a(X^(s), N > 1,0 < s < t) be the u-field generated by the 
information up to time t. To simplify notation, we write fif = (B^ t (uj))i<j<m and (3i = 
(Bt i ,t i+1 {uj)) 1 jL 1 . Since fi£ is conditionally independent of fii , . . . gi ven -^^(lagiV) 3 ; 



E 



Li=l 

Lemma [S3] states that 
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fc-i 



i=l 



n/i(/3f) P A(^)|^ fe _ l( lo: 



gA T)3 



(179) 



(180) 



where — )- p denotes convergence in probability as N — > oo. Using the identity of real numbers 

x y — xy = x (y — y) + y(x' — x) 

in (nzaD with x' = UiZlhwfi y' = E[f k (^)\F tk _ i{logN)3 }, x = u.i=lmm ^ v 

B[fk(Pk)], and then taking the expectation, we get 



E 



.1=1 



f[E[f i (p i )]=E 



i=l 



k-1 



Um 



i=l 



+ E[f k ((3 k )} [ E 



E 



k-1 



fkW, 



N \ 



i=l 



t fc _i(logJV) 3 
k-1 

U E lh(fc 

i=l 



E[f k (h)\ 



The first term on the right-hand side converges to by the Dominated Convergence Theorem 
and (|180p since fi,---,fk are bounded, and the second term converges to by the induction 
hypothesis. This finishes the proof of Lemma [331 □ 



7.5 Tightness 

Our goal in this subsection is to prove the following tightness result. 

Lemma 55. Assume the hypotheses of Theorem [5] hold. For < i < K — 1, the sequence of 
random discrete bridges (B^ t (u), < u < 1) is a tight sequence with respect to the Skorohod 
topology. 
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Proof. For 5 > and a function i? : [0, 1] —> [0, 1], define 

w'(B,5) = inf max sup \B(x) — B(y)\, 

{xj} 3 x,y£[xj,Xj + i) 

where the infimum is taken over all subdivisions {xj} of [0, 1] with = xq < x\ < ■ ■ ■ < x m = 1 
and min(xj+i — Xj) > 5. It suffices to show (see Chapter 13 of [11]) that for all e > 0, there exists 
5 > such that 

limsup P(w'(B% ti+i ,d) > e) < e. (181) 

N— >oo 

Assume for now that i < K — 2. To prove (I18ip . we need to show that two jumps do not occur 
very close to one another. Let e > 0. Let (Z(t),t > 0) be a continuous-state branching process 
with branching mechanism ^ and initial distribution v. Since ^({0}) = 0, the continuous-state 
branching process does not explode or go extinct, so we can fix < a < 1 such that 

P(a < Z(t i+1 ) < l/a) > 1 - e/4. 

Let A(a,N) be the event that aN(logN) 2 < Z N (t i+ i(log N) 3 ) < a^N^ogN) 2 . By Proposition 
[H we can choose N Q so that for all N > N , we have P(A(a, N)) > 1 - e/2. 
For < x < 1, let 

Z",t i+ M = Z N {t i+l {\ogNf)Bl u+i {x) 

Li(x) 

= E E 

We now define our subdivision {xj}. Let xo = 0, and for j > 1 such that x.,_i < 1, let 

x, = 1 A min{x > : Zg Ml (x) - ^ ti+1 (x;-l) > aeiV(log iV) 2 }. 

Since P(y4(a,iV)) > 1 — e/2, and since this subdivision ensures that \Bg t (x) — B^ >t (y)\ < e 
for all x,y £ [xj,Xj + i) on the event A(a,N), it remains only to show that there is a 5 > such 
that 

limsup P(A(a,N) n {min(x J - - x.,-_i) < 5}) < e/2. 

AT-J-oo i 

Let 13, be the event that Xj < 1 — (5. On the event A(a, iV), there can be at most 1/ea 2 values 
of Xj less than 1. Also, on the event Dj, we have xj — Xj-i < S if and only if Z^ t (xj-i + 5) — 
Zt? )t (xj—i) > aeiV(log TV) 2 . Therefore, it suffices to show that there exists <5 > such that 

limsup - < ti+1 (l -S)> aeN (log N) 2 ) < e/4 

and for all < i < (1/ea 2 ) - 1, 

limsup P(A n {Z^ixj-x + 5)- Z^{x S - X ) > aeiV(logiV) 2 }) < e 2 a 2 /4. 

In view of Lemma [5T| both of these statements follow from an application of Proposition [H in 
which the distribution of 5Z(ti) plays the role of v. 

If i = K — 1, the proof proceeds in the same way, except that instead of working with Zj^ t . +i , 
we define M$L t (x) = M^(tK^og N) 3 )B tK _ lt t K (x). The subdivision is defined by xq = and, 
for j > 1, 

= 1 A min{x > : M»_ utK (x) - M^Jx^) > 2vraeiV}. 
The proof concludes with an application of Theorem [2] rather than Proposition [TJ □ 
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Because the tightness of each sequence (B^ t (u), < u < 1) implies the joint tightness of 
the K sequences of bridges, Lemmas [531 and [55] combine to yield the following corollary. 

Corollary 56. The sequence of processes ({B^ tl (u), B^ t2 (u), . . . , Bg_ utK (u)),0 < u<l) con- 
verges in the Skorohod topology to {{B toM (u), B tlM (u), . . . , B tK _ lt t K (u)), < u < 1). 

7.6 Coalescence 

Let D be the set of functions / : [0, 1] — > R that are right continuous and have left limits. Let p 
denote the Skorohod metric on D. Let A denote the set of functions A : [0, 1] — > [0, 1] that are 
continuous and strictly increasing and satisfy A(0) = and A(l) = 1. Recall (see Chapter 12 of 
[TT] ) that if /, /i, /2, • • • are functions in D, then lim n _ s . OC) p(f n , /) = if and only if there exists 
a sequence of functions (X n )^ =1 in A such that 

lim sup \f n (K(t))- f(t)\=0 (182) 

"^°°0<K1 

and 

lim sup \X n (t)-t\ = 0. (183) 

n ^°°0<i<l 

The lemma below is similar to Lemma 1 of [8] but differs in that we do not require the 
processes Bn to have exchangeable increments. 

Lemma 57. Suppose 6, 61,62,... are functions from [0,1] to [0,1] that are nondecreasing and 
right continuous and have left limits at every point other than 0. Suppose limjv->oo /o(6/v, 6) = 0, 
where p denotes the Skorohod metric. Suppose (xn)n=i an ^ (un)n=i are sequences in [0, 1] such 
that xn — > x and yN — >■ V as N — >• 00. Suppose x and y are not in the closure of the range 
ofb. Then for sufficiently large N we have 6^ 1 (xjv) = 6^ 1 (?Mr) if and only ifb^ 1 (x) = 6~ 1 (y). 
Furthermore, 

lim b7}(x N ) = b-^x). (184) 

N— >oo 

Proof. Because x is not in the closure of the range of 6, there exists some maximal open interval 
(u, v ) with u < x < v such that (u, v) does not intersect the range of 6. For sufficiently small 5, we 
have u + 25 < x < v + 25, which implies u + 5 < xn < v — 5 for sufficiently large N. By condition 
(|182|) applied to 6^v and 6, for sufficiently large N the interval (u + 5, v — 5) does not intersect the 
range of b]y. Therefore, there exists 7jv such that 6^(7^) > v — 5 and 6^(7^—) < u + 5. Then 
bJ^(xN) = 7tv for sufficiently large JV. Also, there is a sequence of functions (\n)n=i m ^ such 
that Aat(6 _1 (x)) = 7/v for sufficiently large N by (|182p and therefore lini7v_s.oo In = 6 _1 (x) by 
(fT83]) . The result (fTM]) follows. 

Suppose b~ 1 (x) = b~ 1 {y). Because 6 is right continuous with left limits, we have u < y < v. 
Arguing as above, we have 6^- 1 (t/at) = 77V for sufficiently large N, and thus 6^- 1 (x/v) = b^ l 1 (yN) 
for sufficiently large N. Alternatively suppose 6 _1 (x) 7^ b~ l (y). We may assume without loss of 
generality that x < y. Then y > v, and there is some open interval (r, s) with v < r < y < s 
such that (r, s) does not intersect the range of 6. As above, there exists 5 > and £zv such that 
for sufficiently large N, we have 6at(^/v) > s — 5, b^i^N— ) <r + 5, and 6^ r 1 (?/7v) = £/v- Therefore, 
&jV ( x n) (un) for sufficiently large N, and the lemma follows. □ 

Proof of Theorem Fix times = to < tl < ' " < t-K = t. By Corollary [56] and Skoro- 
hod's Representation Theorem, we may work on a probability space on which the sequence of 
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discrete bridges ((B^ t (u),B^ t2 (u),...,B^ t (u)),0 < u < 1) converges almost surely to 
((BtQ^iu), B tl j 2 (u), . . . ,B tl( _ 1: t K {u)),Q < u < 1). Note that in this setting, almost sure conver- 
gence means that p(B^ t . +i , B t .^ i+1 ) — > as N — > oo for i = 0, 1, . . . , K — 1, where p denotes the 
Skorohod metric. 

Fix a positive integer n, and let Ui, . . . ,U n be independent random variables having the 
uniform distribution on [0,1]. For < i < K — 1, define the partition ir(B^ t ) = n(B^ tK ) 
to be the partition of {1,... ,n} such that i and j are in the same block of the partition if 
and only if (B% ttK )-\Uj = Likewise, define 7r(£?^) = vr(S tijtx ) to be the 

partition of {1, ... ,n} such that i and j are in the same block of the partition if and only if 
Bj~. t (Ui) = Bj~ t (Uj). It follows from the definition of the processes B^ tR that i and j are in the 
same block of the partition if and only if the individuals who are in positions [L r jM/ v r(t^(log iV) 3 )] 
and [[7jM/v(iR-(log -/V) 3 )] in the lexicographical order at time ift"(log iV) 3 are descended from the 
same ancestor at time ti(logN) 5 . As a result, we have the equality in distribution 

{AK-i,t K )' ■ ■ ■ ' «,t K )) =d ( n iv(2^ - t K -i)), • • • , njv(27r(t - t ))) , (185) 

where II tv is the process defined in Theorem El We note that the sampling scheme here using 
the random variables Ui, . . . , U n corresponds to sampling with replacement from the individuals 
at time t^-(log iV) 3 , but the difference between sampling with and without replacement is unim- 
portant because the probability of sampling the same individual twice tends to zero as N — > oo. 
We claim that for < % < K — 1, almost surely 

*(B» tK ) = 7r(B tutK ) (186) 

for sufficiently large N. Because we know the process (^(B t _ s / 27T) t)^ < s < 2-7rt) is the 
Bolthausen-Sznitman coalescent run for time i, this claim in combination with (|185p will im- 
ply Theorem [3l 

We now prove (I186P by backwards induction. Since the lengths of the intervals of the comple- 
ment of the range of Bt K _ x $ K have a Poisson-Dirichlet distribution and thus sum to 1 (see, e.g., 
Proposition 2 in [60J), the closure of the range of B tK _ 1) t K has Lebesgue measure zero almost 
surely. Therefore, almost surely Ui,...,U n are not in the closure of the range of Bt K _ lt t K - It 
follows from Lemma [57] that n(Bj^ K _ i t ) = 7r(-E>t K _i,t K ) for sufficiently large N almost surely. 
Furthermore, 

^jBg^rHUj) = Bg_ utK <JJ s ) 

almost surely for j = 1, ... ,n. Also, by Lemma 2 of [8], the random variables B7_^ t K (Uj) each 
have the uniform distribution on [0, 1]. 

For the induction step, suppose that for some i = 2, . . . , K — 1, the following hold: 

• We have ^lim (B^ tR )~ 1 (Uj) = B^ tK (Uj) almost surely for j = 1, . . . ,n. 

• The random variables B^\ (Uj) each have the uniform distribution on [0, 1]. 

Now (B^yHUj) = (B^ uU )-H(Bl tK )-HUj)) by Lemma [Ml and likewise B^JUj) = 
^U-i t (Bp t K (Uj))- Because the random variables B^ tK (Uj) each have the uniform distribution 
on [0,1] and are independent of -Bt i _ li i i , almost surely none of these random variables is in 
the closure of the range of B ti _ 1) t i - Since also (Bj? tK )~ l (Uj) — > B^. t (Uj) almost surely for 
j = l,...,n, Lemma [571 implies that n(B^_ lt ) = 7r(B til ^ K ) for sufficiently large N almost 
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surely. Furthermore, (B^_ lt ) 1 (Uj) — > B t . t K (Uj) almost surely for j = 1, . . . ,n. By Lemma 
2 of [8], the random variables BjT t K {Uj) each have the uniform distribution on [0, 1]. The claim 
(|186p now follows by induction. □ 
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